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Description 

PURIFIED AND ISOLATED piwi FAMILY GENES AND GENE PRODUCTS 
AND METHODS EMPLOYING SAME 



Application 60/110.091, filed December 4, 1998, the disclosure of which is 
incorporated herein by reference in its entirety. 



10 Government has certain rights in the invention. 

Technical Field 

The present invention relates generally to isolated and purified proteins 
and nucleic acids which modulate stem cell renewal, growth and division and 
which modulate primordial germ cell proliferation. More particulariy, the 
15 present invention relates to isolated and purified p/vW family proteins and 
isolated and purified polynucleic acids encoding the same. 

The publications and other materials used herein to illuminate the 
background of the invention, and in particular cases, to provide additional 
details respecting the practice, are incorporated herein by reference, and for 
20 convenience, are referenced by author and year of publication in the following 
text, and respectively grouped by author in the appended list of references. 

Table of Abbreviations 



5 



Prioritv ApDlication Information 
This application claims the benefit of United States Provisional Patent 
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This work was supported by NIH grant HD 33760. The U.S. 



APC 



apical polar ceil(s) 
American Type Culture Collection 
basic fibroblast growth fector 
buffalo rat liver 



ATCC 



25 



bFGF 



BRL 



BSA 



bovine serum albumin 



Cb 



cystoblast 
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CDR(s) complementarity determining region(s) 

COOH carboxy temninus 

CyO a strain of Dmsophila 

DAPI a DNA specific dye 

5 DIVIEI^^ Dulbecco's modified Eagle's medium 

DC differentiating 16-cell cysts 

DTC distal tip cell 

ECL enhanced chemiluminescent 

EDTA ethylenediamine tetraacetate 

10 ES epithetial sheath 

EST expressed sequence tags 

FBS fetal bovine serum 

FC follicle cell 

FRT^OA a strain of Dmsophila 

15 Fu fusome 

GC-MS gas chromatography-mass spectroscopy 

Ge germarium(ia) 

GSC(s) gennline stem cell(s) 

HAT cell culture media comprising hypoxanthine, 

20 aminopterin, and thymidine 

hiwi/H\\N\ a human homolog of the p/ty//PI Wl gene and gene 
product 

HPLC high pressure liquid chromatography 

hsGal4 a heat shock inducible transgene 

25 IS inner sheath 

KLH keyhole limpet hemocyanin 

MC mitotically active cysts 

m/WMIWI a murine homolog of the p/m7PIWI gene and gene 
product 

30 MPZ mitotic proliferation zone 

myc human oncogene used herein as molecular tag for 
piwi 
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10 



NH2 

ORF 

PGR 

PGC(s) 

pAy//PIWI 

RACE 
RNAi 

SSC(s) 

Sp 

TFC 

UTR 

VASA 

WT 



amino terminus 
open reading frame 
polymerase ciiain reaction 
primordial germ cell(s) 

a gene and gene product isolated from Drosophila 

and having stem cell division modulating activity 

rapid amplification of cDNA ends 

RNA interference assay 

somatic stem cell(s) 

spectrosome 

temiinal filament cells 

untranslated region 

a gennllne specific marker 

wild type 



15 Background Art 

Stem cells are a very small number of founder cells that play a central 
role in tissue development and maintenance. In human bodies, stem cells are 
responsible for generating and/or maintaining approximately 90% of cells in the 
adult tissues. Over-proliferation of malignant stem cells is the leading cause 

20 of cancer while under-proliferation of stem cells or stem-like progenitor cells 
leads to tissue dystrophy, anemia, immunodeficiency, and male infertility. The 
crucial role of stem cells has long been attributed to their ability to self-renew 
and to generate immense number of specialized cells on demand. 

The ability of stem cells to self-renew and to produce a large number of 

25 differentiated progeny is critical forthe development and maintenance of a wide 
variety of tissues in organisms ranging from insects to mammals (reviewed in 
Potten, 1997; Lin. 1997; Lin and Schagat, 1997; Monison et ah, 1997). This 
self-renewing ability is controlled both by extrinsic signaling and by cell- 
autonomous mechanisms (reviewed in Monison et al., 1997; Lin and Schagat, 

30 1 997). Cell autonomous mechanisms have been elucidated in a few stem cell 
models such as neuroblasts and germline stem cells in Drosophila (Lin and 
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Schagat, 1997; Deng and Lin, 1997), whereas the role of extrinsic signaling 
has been elucidated in several systems. For example, the proliferation and 
difTerentiation of mammalian stem cells In the hematopoietic, epidermal, and 
nervous systems depend on extrinsic signals that act on specific receptors on 
5 the stem cell surface (Monison et a!., 1 997). 

In diverse organisms ranging from invertebrates to mammals, the 
proliferation of gemi cells, some of which possess stem cell properties, has 
been postulated, and, in some cases, shown to be regulated by neighboring 
non-mitotic somatic cells (Lin, 1997). Parllculariy, in C. elegans, cell-cell 

10 interactions between the somatic distal tip cell (DTC) at the end of each 
gonadal ami and the underiying mitotic germline nuclei via the lag-2/glp-1 
signaling pathway provides a paradigm forsoma-germline interaction (reviewed 
in Kimble and Simpson, 1997). The glp-1 pathway is required to maintain a 
population of mitotically active nuclei in the germline. 

15 However, few molecules and/or mechanisms identified in a particular 

type of stem ceils have been shown to be applicable to other stem cell 
systems. For example, the glp-l equivalent pathway in Drosophila does not 
play a role in regulating GSC division and maintenance (Ruohala et aL, 1991 ; 
XuetaL,1992). 

20 The self-renewing asymmetric division of GSCs In the Drosophila ovary 

is known to be controlled both by an intracellular mechanism (Deng and Lin, 
1997) and by cell-cell interactions (Lin and Spradiing, 1993). The intracellular 
mechanism involves a cytoplasmic organelle temned the spectrosome that 
controls the orientation of GSC division (Lin et al., 1994; Deng and Lin, 1997). 

25 The cell-cell interaction mechanism involves temiinal filament cells, as shown 
by laser ablation studies (Lin and Spradiing, 1993). Recently, dpp has been 
shown as a key signaling molecule required for GSC division and maintenance 
(Xie and Spradiing, 1998). It is possible that the dpp signal emanates from 
somatic cells. Alternatively, dpp signal may originate from the germline or even 

30 within GSCs, like its mammalian homologs (Zhao et al., 1996). 

In mammals, primordial gemr! cells cultured from the genital ridge have 
the ability to give rise to pluripotent embryonic stem cells. For example. U.S. 
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Patent No. 5,690,926 issued November 25, 1997 to Hooan: U.S. Patent No. 
5.670,372 issued September 23, 1997 to Hogan: and U.S. Patent No. 
5,537,357 issued September 26, 1995 to Hogan each disclose pluripotentlal 
mammalian embryonic stem cells and methods of making the same. The 
5 disclosure of these patents is limited to mammalian embryonic stem ceils and 
particularly to the culturing of murine and other mammalian embryonic stem 
cells using a combination of growth factors consisting of SCF, FGF and LIF. 

Cunrent prior art reports on the culture of avian primordial genm cells 
(PGCs) have concentrated on efforts to maintain a PGC-phenotype and to 

10 stimulate proliferation. See e.g. . Chang. I. K, et aL . CelL Biol. Int 1997 Aug; 
21(8): 495-9; Chang, i. K. et al. . Cell. BioL Int 1995 Feb; 19(2): 143-9; Allioli, 
N. et a!.. Dev. BioL 1994 Sep; 165(1): 30-7 and PCT Publication No. WO 
99/06533, published February 11, 1999 (Applicant - University of 
Massachusetts; Inventors - Ponce de Leon et al. ). 

15 As illustrated above, numerous attempts have been devoted to identify 

genes that control the self-renewing ability of stem cells or the proliferation of 
primordial germ cells. As a result, a number of growth factors and signaling 
molecules, such as Sfee/ factor and its c-kit receptor, have been identified to 
regulate such activity in certain tissues. Despite this progress, there remains 

20 a long-felt and continuing need to identify genes that play a role in modulating 
the growth and self-renewing division of stem cells, particulariy GSCs, and that 
play a role in modulating proliferation of primordial gem cells. 

SMmmary of thp Inventign 
The present invention contemplates an isolated and purified family of 
25 genes and gene products (the piwi family) which plays a role in the growth, 
proliferation and self-renewing division of stem cells, and proliferation of 
primordial gemn cells. More preferably, a polypeptide of the invention is a 
recombinant polypeptide. Even more preferably, a polypeptide of the present 
invention comprises a vertebrate piwi family polypeptide. Even more 
30 preferably, a polypeptide of the present invention comprises a mammalian piwi 
family polypeptide. Even more preferably, a polypeptide of the present 
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invention comprises a human p/w/ family polypeptide. Even more preferably, 
a polypeptide of the present invention comprises an amino acid sequence from 
the amino acid residue sequences of any of SEQ ID N0s:2, 4 and 6. 

The present invention also provides an isolated and purified 
5 polynucleotide that encodes a polypeptide that plays a role in the growth, 
proliferation and self-renewing division of stem cells, and proliferation of 
primordial gemn cells. In a preferred embodiment, a polynucleotide of the 
present invention comprises a DNA molecule from a vertebrate species. 
Prefenned vertebrates comprise mammals, birds or fish. A preferred mammal 

10 is a human. More preferably, a polynucleotide of the present invention 
encodes a polypeptide designated PIWI. Even more preferably, a 
polynucleotide of the present invention encodes a polypeptide comprising an 
amino acid residue sequence of any of SEQ ID N0s:2, 4 and 6. Most 
preferably, an isolated and purified polynucleotide of the invention comprises 

1 5 a nucleotide base sequence of any of SEQ ID NOs:1 , 3 and 5. 

In another embodiment, the present invention contemplates an antibody 
immunoreactive with a p/w/ family polypeptide as described above. SEQ ID 
NOs:3-6 set forth nucleotide and amino acid sequences from representative 
vertebrates, human and mouse. Also contemplated by the present invention 

20 are antibodies immunoreactive with homologues or biologically equivalent piwi 
family polynucleotides and polypeptides found in other vertebrates. Preferably, 
an antibody of the invention is a monoclonal antibody. More preferably, the 
p/w/ family polypeptide comprises human PIWI (HIWI). Even more preferably, 
the piwi polypeptide comprises an amino acid residue sequence of any of SEQ 

25 IDNOs:2,4and6. 

In another aspect, the present invention contemplates a method of 
producing an antibody immunoreactive with a piwi family polypeptide as 
described above, the method comprising the steps of (a) transfecting a 
recombinant host cell with a polynucleotide that encodes a biologically active 

30 p/w/ family polypeptide; (b) culturing the host cell under conditions sufficient for 
expression of the polypeptide; (c) recovering the polypeptide; and (d) preparing 
the antit>ody to the polypeptide. SEQ ID N0s:3-6 set forth nucleotide and 
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amino acid sequences from representative vertebrates, liuman and mouse. 
Preferably, the host cell is transfected with a polynucleotide of any of SEQ ID 
N0s:1, 3 and 5. Even more preferably, the present invention provides an 
antibody prepared according to the method described above. Also 
5 contemplated by the present invention is the use of homologues or biologically 
equivalent polynucleotides and polypeptides found in other vertebrates to 
produce antibodies. 

Alternatively, the present invention provides a method of detecting a 
piwi family polypeptide as described above, wherein the method comprises 

10 immunoreacting the polypeptide with an antibody prepared according to the 
method described above to form an antibody-polypeptide conjugate, and 
detecting the conjugate. 

In yet another embodiment, the present invention contemplates a 
method of detecting a messenger RNA transcript that encodes a piwi family 

1 5 polypeptide as described above, wherein the method comprises hybridizing the 
messenger RNA transcript with a polynucleotide sequence that encodes that 
polypeptide to form a duplex; and detecting the duplex. Alternatively, the 
present invention provides a method of detecting a DNA molecule that encodes 
a p/w/ family polypeptide as described above, wherein the method comprises 

20 hybridizing DNA molecules with a polynucleotide that encodes a biologically 
active p/w/ family polypeptide to form a duplex; and detecting the duplex. 

In another aspect, the present invention contemplates an assay kit for 
detecting the presence of a piwi family polypeptide in a biological sample, 
where the kit comprises a first container containing a first antibody capable of 

25 immunoreacting with a biologically active piwi polypeptide, with the first 
antibody. Preferably, the first antibody is present in an amount sufficient to 
perfomi at least one assay. Also preferably, an assay kit of the invention 
further comprises a second container containing a second antibody that 
immunoreacts with the first antibody. More preferably, the antibodies used in 

30 an assay kit of the present invention are monoclonal antibodies. Even more 
preferably, the first antibody is affixed to a solid support. More preferably still, 
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the first and second antibodies comprise an Indicator, and, preferably, the 
indicator is a radioactive label, a fluorescent label or an enzyme. 

In an alternative aspect, the present invention provides an assay kit for 
detecting the presence, in biological samples, of a p/W/ family polypeptide, the 
5 kits comprising a first container that contains a second polynucleotide identical 
or complementary to a segment of at least 1 0 contiguous nucleotide bases of 
a polynucleotide that encodes a biologically active p/w/ family polypeptide. 

In another embodiment, the present invention contemplates an assay 
kit for detecting the presence, in a biological sample, of an antibody 
10 immunoreactive with a piwi family polypeptide, the kit comprising a first 
container containing a biologically active piwi family polypeptide that 
immunoreacts with the antibody, with the polypeptide present in an amount 
sufficient to perform at least one assay. 

In still a further embodiment, this invention pertains to transgenic, 
15 therapeutic, screening and culturing methods based upon the self-renewing 
division of stem cells, and proliferation of primordial germ cells function of piwi 
family polynucleotides and polypeptides as described herein. Representative 
therapeutic and culturing methods include administration of a soluble form of 
the piwi protein and gene therapy approaches using an isolated and purified 
20 polynucleotide of the present invention. 

Thus, a key aspect of this invention pertains to the discovery of the 
novel piwi family of proteins and nucleic adds. Preferred nucleic acid and 
amino acid sequences are described in SEQ ID N0s:1-6. 

It is thus another aspect of this invention to provide a purified and 
25 isolated piwifamWy polypeptkle having a role in the growth, proriferation and/or 
self-renewing division of stem cells, and proliferation of primordial genm cells. 

The foregoing aspects and embodiments have broad utility given the 
biological significance of stem cell growth, proliferation and renewal, and 
primordial germ cell proliferation. By way of example, the foregoing aspects 
30 and embodiments are useful in the preparation of screening assays and assay 
kits that are used to identify compounds that affect or modulate piwi family 
biological activity, or that are used to detect the presence of the proteins and 
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nucleic acids of this invention in biological samples. Additionally, it is well 
known that isolated and purified polypeptides have utility as feed additives for 
livestocic and further polynucleotides encoding the polypeptides are thus useful 
in producing the polypeptides. 
5 Some of the aspects and objects of the invention having been stated 

hereinabove, other aspects and objects will become evident as the description 
proceeds, when \aken in connection with the accompanying Examples and 
Drawings as best described hereinbelow. 

Brief Description of the Drawings 

1 0 Figure 1 is a molecular map of the piwi locus. Restriction enzyme map 

for 15 l^b of genomic DNA surrounding the piwi mutations is shown. Three 
transcripts are represented by thick arrows indicating their relative positions, 
sizes, splice sites and direction of transcription. The relative insertion sites of 
the P-element alleles is indicated alx)ve the restriction map with orientation of 

15 insertion indicated by black (5') and white (3') boxes. The P insertions are not 
shown according to the scale, with piwi^ insertion partially mapped. Restriction 
sites: R, EcoR/; H, Hindlll] P. Psth S, Sail', X, Xhol. 

Figure 2 depicts alignment of predicted PIWI protein sequence with its 
homologs in the piwi box domain in 22 different proteins. The alignments were 

20 generated using Block Maker™ analysis (Henlkoff et aL, 1995) against amino 
acids 740-782 of the predicted piwi sequence. The amino acid sequences 
were deduced from genomic DNA sequences for C. elegans (Ce G) or from 
cDNA (EST) sequences (E) for Rattus sp. (Rs) and Oryza sativa (Os). 
GenBank accession numbers and cosmid designations (Ce) are indicated for 

25 each sequence. Residues identical to the consensus are highlighted. Shown 
below the alignments, residues which are absolutely conserved are shown as 
capital letters and residues which are 80% or more conserved are shown as 
lower case letters while dashes indicate their spacing. 

Figure 3 is a graph showing thatp/y-RN/t/significantiy reduces germline 

30 proliferation in C. elegans and that reducing prg activity affects fertility. The 
graph indicates the number of progeny produced by a single F1 worni from an 
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RNAi prg-1 or pBST injected mother vs. the percentage of F1 animals which 
produced that given number of progeny. Comparison of prg-RNAi (striped 
bars) with pBST RNAi (solid bars) animals reveals two distinctive distributions. 

Figure 4 depicts the construction of full length hiwi cDNA clone. The 
5 2.3kb cDNA fragment obtained from library screening (herein named 3-12) was 
subcloned into a pBluescript^ KS^ plasmid vector, the 3-12 insert is shown as 
an open box, the polycloning site on the vector is shown as a dark line. The 
1.4kb cDNA fragment obtained from 5'RACE (herein named 1501) was 
subcloned into a pGEM-T Easy™ vector. The 1501 in pGEM-T Easy™ vector 
1 0 was digested with Sad to coristruct a vector for subsequent ligation. The 3-1 2 
in pBluescript® KS* vector was partially digested with Sad to obtain the insert. 
This partially digested 3-12 insert DNA was used to do ligation with 1501 in 
pGEM-T easy vector to construct a full length hiwi cDNA clone. 

Figure 5 is a schematic depicting the miwi locus (genomic location and 
1 5 gene structure). 

Figure 5A is a schematic depicting mouse chromosome 5. 

Figure 5B Is a schematic depiding the m/w/ genomic locus. 

Figure 5C is a schematic depicting the location and strudure on the miwi 

cDNA. 

20 Figure 5D is a schematic depicting the location and structure of miwi 

genomic clones. 

Figure 6 depicts alignment of MIWI protein sequence with HIWI, 
Drosophiia PIWI and C. elegans PRG-1 and PRG-2. Names of the sequences 
are shown on the right and the positions of them are shown on the left. Amino 

25 acid residues klenticai among above four genes are boxed in black. The high 
identical C-temninal domains of 104 amino acid residues are underiined. 

Figure 7 is a schematic diagram of a Drosopliila germarium. The 
schematized germarium Is oriented with the apical end toward the left. The 
numbers below the genmarium In the brackets denote distind regions within the 

30 germarium. Gemnarial region 1 extends from the apical terminal filament cells 
(TFC, including cap cells, CpC) to the zone of mitotlcally active cysts. 
Germarial regions 2a and 2b include differentiating 16-cell cysts. Two somatic 
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stem cells (SSC) are located at the border between regions 2a and 2b. 
Qemiarial region 3 contains a newly fomied egg chamber that is preparing to 
bud off from the germarium and contribute to the developing string of egg 
chambers which comprise the rest of the ovariole. Abbreviations: GSC, 
5 germline stem cell; Sp, spectrosome; ISC, innersheath cell; Cb, cystobiast; Fu, 
fusome; MC, mitoticaliy active cysts; DC, differentiating 16-cell cysts; FC, 
follicle cell. 

Figure 8 is a schematic diagram of the myc-PIWI and PlWI-myc 
constructs. The myc-piwi transgene encodes a 97.5 kDa protein. In the myc- 
10 PIWI construct, the myc epitope is inserted after the third amino acid residue 
of PIWI. In the PlWI-myc construct, the epitope is inserted 13 amino acids 
downstream of the highly conserved PIWI box (hatched), 48 residues away 
from the C-terminus. Abbreviations: NH2, the amino terminus; COOH, the 
carboxy terminus. 

1 5 Figure 9 is a graph displaying the relative division rates of wild-type and 

p/wf" GSCs and demonstrating that PIWI acts cell-autonomously in GSCs to 
promote their division. The number of germane counted is given in 
parentheses. 

Figure 10 is a graph displaying a comparison of the number of 
20 spectrosomes present per wild-type gennarium versus perp/w^overexpressed 
germarium and demonstrating that somatic PIWI over-expression increases the 
number of germline stem cells. The number of gemnaria examined (n) Is in 
parentheses. 

Detailed Desc ription of the Invention 
25 The present invention discloses a novel class of genes required for the 

growth, proliferation and self-renewing division of stem cells, and proliferation 
of primordial gemi cells. The present invention relates to the discovery of a 
novel class of evoiutionarily conserved genes, the piwi family genes, that 
encode a novel group of basic proteins important for stem cell division, 
30 primordial germ ceil proliferation and other developmental processes. 
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The disclosure of the present invention demonstrates that PIWI protein 
can localize either in the nucleoplasm or cytoplasm and that the dose of the 
PIWI protein detennines both the number and the mitotic rate of germiine stem 
cells, and that the PIWI protein is localized in the gennplasm (the cytoplasm of 
5 future gemi ceils) and plays a key role in determining the initial number of germ 
cells in during embryogenesis. The mouse PIWI protein (i.e., MIWI protein) is 
also a cytoplasmic factor with very similar functions in germ cell and stem cell 
detemnination. 

To identify genes essential for stem cell division, germiine stem cells in 
1 0 the Dmsophilia ovary were established as a model by directly identifying these 
cells and demonstrating their self-renewing asymmetric division pattem. This 
stem cell model allowed applicant to effectively circumvent difficulties intrinsic 
to studying mammalian stem cells, such as their rarity, lack of unique 
morphological mail^ers, hyper-sensitivity to in vitro manipulation, and 
1 5 inaccessibility to systematic genetic screen. Systematic genetic screens were 
then conducted to identify mutations that either abolish the self-renewing ability 
of stem cells orthe asymmetry of stem cell division, or both. These screens led 
to the identification of the Drosophilia piwi gene that is essential for gemrillne 
stem cell self-renewal. Subsequent cloning and molecular analysis of the piwi 
20 gene and its homologs In C. elegans, the mouse, and human, and the 
identification of p/kvAlike genes in >An96/dops/s led to the following novel apsects 
of the present invention. 

DQfipttjQng an^l TgchniguQS Affecting 
Gene Products and Genes 
25 As used in the following detailed description and in the claims, the tenm 

"piwi family" refers to a family or group of genes and gene producte including, 
but not limited to, PIWI, HIWI. MIWI. PRG-1 and PRG-2 proteins, and the piwi, 
hiwi, miwi, prg-1 and prg-2 genes, each of which are further defined herein. 
The tenri "p/w/ family" also includes other members of the piwi family of genes 
30 and gene products characterized by biological activity, including but not limited 
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to the biological activities of modulating growth, proliferation and/or self- 
renewing division of stem cells, and/or proliferation of primordial gemi cells. 

Preferably, p/vw family genes and gene products are isolated from 
eukaryotic sources. Thus, the term "p/w; family" also includes invertebrate 
5 homologs. The term "p/vw family" further Includes vertebrate homologs of piwi 
family members, including, but not limited to, mammalian, avian and fish 
homologs. Prefen-ed mammalian homologs of piwi family members Include, 
but are not limited to, murine and human homologs. 

The terms "p/w/ family gene product", "piwi family protein" and "piwi 

10 family polypeptide" refer to proteins having amino acid sequences which are 
substantially identical to the native amino acid sequences in the p/vw family and 
which are biologically active in that they are capable of modulating growth, 
proliferation and/or self-renewing division of stem cells, and/or proliferation of 
primordial germ cells, or cross-reacting with anti-p/Vw family antibodies raised 

1 5 against a piwi family polypeptide. 

The terms "p/w/ family gene product", "piwi family protein" and "piwi 
family polypeptide" also Include analogs of p/w/ family molecules which exhibit 
at least some biological activity in common with native piwi family gene 
products. Furthermore, those skilled in the art of mutagenesis will appreciate 

20 that other analogs, as yet undisclosed or undiscovered, may be used to 
constmct piwi family analogs. There is no need for an "p/vw family gene 
product", "p/w/ family protein" or "p/iv/ family polypeptide" to comprise all, or 
substantially all of the amino acid sequence of a native piwi family gene 
product. Shorter or longer sequences are anticipated to be of use in the 

25 invention. Thus, the tenrn "p/vv/ family gene producf also includes fusion or 
recombinant p/iv/family polypeptides and proteins. Methods of preparing such 
proteins are described herein in the Examples, among other places. 

The temis "piwi family gene", "piwi family gene sequence" and "piwi 
family gene segment" refer to any DNA sequence that is substantially identical 

30 to a polynucleotide sequence encoding a piwi family gene product, piwi family 
protein or piwi family polypeptide as defined above. The temis also refer to 
RNA, or antisense sequences, compatible with such DNA sequences. A "piwi 
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family gene**, "p/Vw family gene sequence" and "p/w/ family gene segmenf may 
also comprise any combination of associated control sequences. 

The temn "substantially identicar, when used to define either a piwi 
family gene product or piW family amino acid sequence, or a p/iv/ family gene 
5 or piwi family nucleic acid sequence, means that a particular sequence, Ibr 
example, a mutant sequence, varies from the sequence of a natural p/m'family 
by one or more deletions, substitutions, or additions, the net effect of which is 
to retain at least some of biological activity of p/Mr/ family. Altematively, DNA 
analog sequences are "substantially identical" to specific DNA sequences 

10 disclosed herein if: (a) the DNA analog sequence is derived from coding 
regions of the natural piwi family gene; or (b) the DNA analog sequence is 
capable of hybridization of DNA sequences of (a) under moderately stringent 
conditions and which encode biologically active p/iv/ family gene product; or (c) 
the DNA sequences are degenerative as a result of the genetic code to the 

1 5 DNA analog sequences defined in (a) and/or (b). Substantially identical analog 
proteins will be greater than about 60% identical to the corresponding 
sequence of the native protein. Sequences having lesser degrees of similarity 
but comparable biological activity are considered to be equivalents. In 
determining nucleic acid sequences, all subject nucleic acid sequences 

20 capable of encoding substantially similar amino acid sequences are considered 
to be substantially similar to a reference nucleic acid sequence, regardless of 
differences in codon sequences. 

Percent Similarity 
Percent similarity may be detentiined, for example, by comparing 
25 sequence informatbn using the GAP computer program, available fifom the 
University of Wisconsin Geneticist Computer Group. The GAP program utilizes 
the alignment method of Needleman et al. 1970, as revised by Smith et al. 
1981. Briefly, the GAP program defines similarity as the number of aligned 
symbols (i.e. nucleotides or amino acids) which are similar, divided by the total 
30 number of symbols in the shorter of the two sequences. The preferred default 
parameters for the GAP program include: (1) a unitary comparison matrix 
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(containing a value of 1 for identities and 0 for non-identities) of nucleotides 
and the weighted comparison matrix of Gribskov et al., 1 986, as described by 
Schwartz et al., 1979; (2) a penalty of 3.0 for each gap and an additional 0.01 
penalty for each symbol and each gap; and (3) no penalty for end gaps. Other 
5 comparison techniques are described in the Examples. 

The term "homology" describes a mathematically based comparison of 
sequence similarities which is used to identify genes or proteins with similar 
functions or motife. Accordingly, the terni "homology" is synonymous with the 
tenn "similarity" and "percent similarity" as defined above. Thus, the phrases 
10 "substantial homology" or "substantial similarity" have similar meanings. 

Nucleic Acid Sequences 
In certain embodiments, the invention concerns the use of p/w/ family 
genes and gene products that include within their respective sequences a 
sequence which is essentially that of a p/Vw family gene, or the corresponding 

1 5 protein. The terni "a sequence essentially as that of a p/W/ family gene", means 
that the sequence substantially con^esponds to a portion of a piwi family 
polypeptide or piwi family gene and has relatively few bases or amino acids 
(whether DNA or protein) which are not identical to those of a piwi family 
protein or piwi family gene, (or a biologically functional equivalent of. when 

20 refening to proteins). The temn "biologically functional equivalent" is well 
understood in the art and is further defined In detail herein. Accordingly, 
sequences which have between about 70% and about 80%; or more 
preferably, between about 81% and about 90%; or even more preferably, 
between about 91% and about 99%; of amino acids which are identical or 

25 functionally equivalent to the amino acids of a piwi family protein or piwi family 
gene, will be sequences which are "essentially the same". 

p/iv/ family gene products and p/w/ family genes which have functionally 
equivalent codons are also covered by the invention. The term "functionally 
equivalent codon" is used herein to refer to codons that encode the same 

30 amino acid, such as the six codons for arglnine or serine, and also to refer to 
codons that encode biologically equivalent amino acids (see Table 1). Thus, 
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when referring to the sequence examples presented in SEQ ID N0s:1-6, the 
substitution of functionally equivalent codons of Table 1 into the sequence 
examples of SEQ ID N0s:1--6 is envisioned. Thus, applicants are in 
possession of amino acid and nucleic acids sequences which include such 
5 substitutions but which are not set forth herein in their entirety for convenience. 
TABLE 1 - FuncHonallv Equivalent Codons, 



Amino Acids Codons 





Alanine 


Ala 


A 


GCA GCC GCG GCU 




Cysteine 


Cys 


C 


UGC UGU 


10 






n 


GAP GAW 




Giumatic acid 


Glu 


E 


GAAGAG 




Phenylalanine 


Phe 


F 


UUC UUU 




Glycine 


Gly 


G 


GGAGGCGGGGGU 




Histidine 


His 


H 


CAC CAU 


15 


Isoleucine 


lie 


1 


AUAAUCAUU 




Lysine 


Lys 


K 


AAA AAG 




Leucine 


Leu 


L 


UUA UUG CUA cue CUG CUU 




Methionine 


Met 


M 


AUG 




Asparagine 


Asn 


N 


AACAAU 


20 


Proline 


Pro 


P 


CCACCCCCGCCU 




Glutamine 


Gin 


Q 


CAACAG 




Arginine 


Arg 


R 


AGA AGG CGA CGC CGG CGU 




Serine 


Ser 


S 


ACG AGU UCA UCC UCG UCU 




Threonine 


Thr 


T 


ACAACCACGACU 


25 


Valine 


Val 


V 


GUAGUC GUG GUU 




Tryptophan 


Trp 


W 


UGG 




Tyrosine 


Tyr 


Y 


UAC UAU 



It will also be understood that amino acid and nucleic acid sequences 



30 may include additional residues, such as additional N- or C-terminal amino 
acids or 5' or 3' sequences, and yet still be essentially as set forth in one of the 
sequences disclosed herein, so long as the sequence meets the criteria set 
forth above, including the maintenance of biological protein activity where 
protein expression is concemed. The addition of terminal sequences 

35 particularly applies to nucleic acid sequences which may, for example, include 
various non-coding sequences flanking either of the 5' or 3' portions of the 
coding region or may include various intemal sequences, i.e., introns, which 
are known to occur within genes. 

The present invention also encompasses the use of DNA segments 

40 which are complementary, or essentially complementary, to the sequences set 
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forth in the specification. Nucleic acid sequences which are ''complementary" 
are those which are base-pairing according to the standard Watson-Crick 
complementarity rules. As used herein, the term "complementary sequences" 
means nucleic acid sequences which are substantially complementary, as may 
5 be assessed by the same nucleotide comparison set forth above, or as defined 
as being capable of hybridizing to the nucleic acid segment in question under 
relatively stringent conditions such as those described herein. A particular 
example of a contemplated complementary nucleic acid segment is an 
antisense oligonucleotide. 

10 Nucleic acid hybridization will be affected by such conditions as salt 

concentration, temperature, or organic solvents, in addition to the base 
composition, length of the complementary strands, and the number of 
nucleotide base mismatches between the hybridizing nucleic acids, as will be 
readily appreciated by those skilled in the art. Stringent temperature conditions 

15 will generally include temperatures in excess of 30 °C, typically in excess of 
SZ^'C, and preferably in excess of 45^C. Stringent salt conditions will ordinarily 
be less than 1,000 mM, typically less than 500 mlVI, and preferably less than 
200 mM. However, the combination of parameters is much more important 
than tiie measure of any single parameter. (See, e.g., Wetmur & Davidson, 

20 1968). Determining appropriate hybridization conditions to identify and/or 
isolate sequences containing high levels of homology is well known in the art. 
(See e.g. , Sambrook et ai, 1 989). For the purposes of specifying conditions 
of high stringency, prefened conditions are salt concentration of about 200 mM 
and temperature of about 45''C. 

25 Probe sequences may also hybridize specifically to duplex DNA under 

certain conditions to form triplex or other higher order DNA complexes. The 
preparation of such probes and suitable hybridization conditions are well known 
in the art. 

As used herein, the term "DNA segmenf refers to a DNA molecule 
30 which has been isolated free of total genomic DNA of a particular species. 
Furttiemiore, a DNA segment encoding a p/vw femily polypeptide refers to a 
DNA segment which contains piwi family coding sequences, yet is isolated 
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away from, or purified free from, total genomic DNA of a source species, such 
as Homo sapiens. Included within the temn "DNA segment" are DNA segments 
and smaller fragments of such segments, and also recombinant vectors, 
including, for example, plasmids, cosmids, phages, viruses, and the liice. 
5 Similarly, a DNA segment comprising an isolated or purified p/m family 

gene refers to a DNA segment including piwi famWy coding sequences isolated 
substantially away ftom other naturally occuning genes or protein encoding 
sequences. In this respect, the term "gene"* is used for simplicity to refer to a 
functional protein, polypeptide or peptide encoding unit. As will be understood 

10 by those in the art, this functional term includes both genomic sequences and 
cDNA sequences. "Isolated substantially away from other coding sequences" 
means that the gene of interest, in this case, the piwi family gene, forms the 
significant part of the coding region of the DNA segment, and that the DNA 
segment does not contain large portions of naturally-occurring coding DNA, 

15 such as large chromosomal fragments or other functional genes or cDNA 
coding regions. Of course, this refers to the DNA segment as originally 
isolated, and does not exclude genes or coding regions later added to the 
segment by the hand of man. 

In particular embodiments, the invention concerns isolated DNA 

20 segments and recombinant vectors incorporating DNA sequences which 
encode a p/iv; family polypeptide that includes within its amino acid sequence 
an amino acid sequence selected from any of SEQ ID N0s:2, 4 and 6. In other 
particular embodiments, the invention concerns isolated DNA segments and 
recombinant vectors incorporating DNA sequences which encode a protein that 

25 includes within its amino add sequence the amino acid sequence of a piwi 
family polypeptide conresponding to human tissues (refen^ed to herein as HIWI). 

It will also be understood that this invention is not limited to the particular 
nucleic acid and amino acid sequences of any of SEQ ID N0s:1-6. 
Recombinant vectors and isolated DNA segments may therefore variously 

30 include the piwi family polypeptide-encoding region itself, include coding 
regions bearing selected alterations or modifications in the basic coding region, 
or include encoded larger polypeptides which nevertheless include p/vw family 
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polypeptide-encoding regions or may encode biologically functional equivalent 
proteins or peptides which have variant amino acid sequences. 

In certain embodiments, the invention concerns isolated DNA segments 
and recombinant vectors which encode a protein or peptide that includes within 
5 its amino acid sequence an amino acid sequence essentially as set forth in any 
of SEQ ID NOs:2, 4 and 6. Naturally, where the DNA segment or vector 
encodes a lull length p/w/ family gene product, the most prefenred nucleic acid 
sequence is tiiat which is essentially as set forth in any of SEQ ID N0s:1 , 3 
and 5 and which encode a protein that exhibits activity in the modulation of 

10 growth, proliferation and self-renewing division of stem cells (in for example 
gemiline stem cells), and proliferation of primordial germ cells as may be 
determined by, for example, immunofluorescence assays, as disclosed herein. 

The term "a sequence essentially as set forth in any of SEQ ID N0s:2, 
4 and 6" means that the sequence substantially corresponds to a portion an 

1 5 amino acid sequence of any of SEQ ID NOs:2, 4 and 6 and has relatively few 
amino acids which are not identical to, or a biologically functional equivalent of, 
the amino acids of an amino acid sequence of any of SEQ ID NOs:2, 4 and 6. 
The term "biologically functional equivalent" is well understood in the art and 
is further defined in detail herein. Acconjingly, sequences, which have 

20 between about 70% and atx)ut 80%; or more preferably, between about 81 % 
and about 90%; or even more preferably, between about 91 % and about 99%; 
of amino acids which are identical orfunctionally equivalent to the amino acids 
of any of SEQ ID NOs:2, 4 and 6, will be sequences which "a sequence 
essentially as set forth in any of SEQ ID N0s:2, 4 and 6". 

25 In particular embodiments, the invention concerns gene therapy 

methods that use isolated DNA segments and recombinant vectors 
incorporating DNA sequences which encode a protein that includes within its 
amino acid sequence an amino acid sequence of any of SEQ ID N0s:2, 4 and 
6,any of SEQ ID NOs:2, 4 and 6 including sequences which are derived from 

30 mammalian tissue, among other tissue. In other particular embodiments, the 
invention concerns isolated DNA sequences and recombinant DNA vectors 
incorporating DNA sequences which encode a protein that includes within its 
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amino acid sequence the amino acid sequence of tlie p/vw family protein from 
human tissue (l-IIWI). 

in certain other embodiments, the invention concerns isolated DNA 
segments and recombinant vectors that include within their sequence a nucleic 
5 acid sequence essentially as set forth in any of SEQ ID NOs:1 , 3 and 5. The 
temn "a sequence essentially as set forth in any of SEQ ID N0$:1 , 3 and 5" are 
used in the same sense as described above and means that the nucleic acid 
sequence substantially conBsponds to a portion of any of SEQ ID NOs: 1 , 3 and 
5, respectively, and has relatively few codons which are not identical, or 

10 functionally equivalent, to the codons of any of SEQ ID N0s:1, 3 and 5, 
respectively. Again, DNA segments which encode gene products exhibiting 
activity in the modulation of growth, proliferation and self-renewing division of 
stem cells, and proliferation of primordial genu cells or other biological activity 
of the p/w/ family gene product will be most preferred. The term "functionally 

15 equivalent codon" is used herein to refer to codons that encode the same 
amino acid, such as the six codons for arginine or serine, and also to refer to 
codons that encode biologically equivalent amino acids (see Table 1). 

The nucleic acid segments of the present invention, regardless of the 
length of the coding sequence itself, may be combined with other DNA 

20 sequences, such as promoters, enhancers, polyadenylation signals, additional 
restriction enzyme sites, multiple cloning sites, other coding segments, and the 
like, such that their overall length may vary considerably. It is therefore 
contemplated that a nucleic acid fragment of almost any length may be 
employed, with the total length preferably being limited by the ease of 

25 preparation and use in the intended recombinant DNA protocol. For example, 
nucleic acid firagments may be prepared which include a short stretch 
complementary to a nucleic acid sequence set for in any of SEQ ID NOs:1 , 3 
and 5 respectively, such as about 10 nucleotides, and which are up to 10,000 
or 5,000 base pairs in length, with segments of 3,000 being prefenred in certain 

30 cases. DNA segments with total lengths of about 1 ,000, 500, 200, 100 and 
about 50 base pairs in length are also contemplated to be useful. 
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The DNA segments of the present invention encompass biologically 
functional equivalent p/w/ family proteins and peptides. Such sequences may 
rise as a consequence of codon redundancy and functional equivalency which 
are known to occur naturally within nucleic acid sequences and the proteins 
5 thus encoded. 

Alternatively, functionally equivalent proteins or peptides may be created 
via the application of recombinant DNA technology, in which changes in the 
protein structure may be engineered, based on considerations of the properties 
of the amino acids being exchanged, e.g. substitution of He for Leu at amino 

1 0 acid 1 3, at amino acid 1 06 and/or at amino acid 785 for PiWI in SEQ ID NOs:1- 
2; substitution of He for Leu at amino acid 90, at amino acid 383 and/or at 
amino acid 816 for l\^IWI In SEQ ID NOs:3-4; and substitution of lie for Leu at 
amino acid 76, at amino acid 303 and/or at amino acid 735 for IHIWI in SEQ ID 
NOs:5-6. Changes designed by man may be introduced through the 

15 application of site-directed mutagenesis techniques, e.g., to introduce 
improvements to the antigenicity of the protein or to test p/w/ family mutants in 
order to examine activity in the modulation of growth, proliferation and self- 
renewing division of stem cells, and proliferation of primordial germ cells, or 
other activity at the molecular level. 

20 If desired, one may also prepare fusion proteins and peptides, e.g., 

where the p/w/ family coding region is aligned within the same expression unit 
with other proteins or peptides having desired functions, such as for purification 
or immunodetection purposes (e.g., proteins which may be purified by afHnity 
chromatography and enzyme label coding regions, respectively). 

25 Recombinant vectors form important further aspects of the present 

invention. Particulariy useful vectors are contemplated to be those vectors in 
which the coding portion of the DNA segment is positioned under the control 
of a promoter. The promoter may be in the form of the promoter which is 
naturally associated with the p/w/ family gene, e.g., in mammalian tissues, as 

30 may be obtained by isolating the 5' non-coding sequences located upstream 
of the coding segment or exon, for example, using recombinant cloning and/or 
PGR technology, in connection with the compositions disclosed herein. 
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In other embodiments, it is contemplated that certain advantages will be 
gained by positioning the coding DNA segment under the control of a 
recombinant, or heterologous, promoter. As used herein, a recombinant or 
heterologous promoter is intended to refer to a promoter that is not nomnaliy 
5 associated with a pnv/ family gene in its natural environment. Such promoters 
may include promoters isolated from bacterial, viral, eukaryotic, fish, avian or 
mammalian cells. Naturally, it will be important to employ a promoter that 
effectively directs the expression of the DNA segment in the ceil type chosen 
for expression. The use of promoter and cell type combinations for protein 

10 expression Is generally known to those of skill in the art of molecular biology, 
for example, see Sambrook et al., 1989, specifically incorporated herein by 
reference. The promoters employed may be constitutive, or inducible, and can 
be used under the appropriate conditions to direct high level expression of the 
introduced DNA segment, such as is advantageous in the large-scale 

15 production of recombinant proteins or peptides. Appropriate promoter systems 
contemplated for use in high-level expression include, but are not limited to, the 
vaccina virus promoter and the baculovims promoter. 

In an alternative embodiment, the present invention provides an 
expression vector comprising a polynucleotide that encodes a piwi family 

20 polypeptide having activity in the modulation of growth, proliferation and self- 
renewing division of stem cells (including germline and somatic stem cells), cell 
proliferation (including activity in embryonic development as described in 
Example 1 ), proliferation of primordial gemn cells, or other biological activity in 
accordance with the present invention. Also preferably, an expression vector 

25 of the present invention comprises a polynucleotide that encodes human piwi 
family gene product (HIWI). More preferably, an expression vector of the 
present invention comprises a polynucleotide that encodes a polypeptide 
comprising an amino acid residue sequence of any of SEQ ID N0s:2, 4 and 6. 
More preferably, an expression vector of the present invention comprises a 

30 polynucleotide comprising the nucleotide base sequence of any of SEQ ID 
N0s:1 , 3 and 5. Even more preferably, an expression vector of the invention 
comprises a polynucleotide operativeiy linked to an enhancer-promoter. More 
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preferably still, an expression vector of the invention comprises a 
polynucleotide operatively linked to a prokaryotic promoter. Alternatively, an 
expression vector of the present invention comprises a polynucleotide 
operatively linked to an enhancer-promoter that is a eukaryotic promoter, and 
5 the expression vector further comprises a polyadenylation signal that is 
positioned 3' of the cartx)xy4enminal amino acid and within a transcriptional unit 
of the encoded polypeptide. 

In yet another embodiment, the present invention provides a 
recombinant host cell transfected with a polynucleotide that encodes a p/wi 

1 0 family polypeptide having activity in the modulation of growth, proliferation and 
self-renewing division of stem cells (including germline and somatic stem cells), 
cell proliferation (Including activity in embryonic development as described in 
Example 1), proliferation of primordial gemi cells, or other biological activity in 
accordance with the present invention. SEQ ID NOs:3-6 set forth nucleotide 

15 and amino acid sequences from representative vertebrates, human and 
mouse. Also contemplated by the present invention are homologous or 
biologically equivalent polynucleotides and piwi family polypeptides found in 
other vertebrates. Preferably, a recombinant host cell of the present invention 
is transfected with the polynucleotide that encodes human piwi family 

20 polypeptide. More preferably, a recombinant host cell of the present invention 
is transfected with the polynucleotide sequence of SEQ ID N0:3 {miwi) or SEQ 
ID NO:5 (hiwi). Even more preferably, a host cell of the invention is a 
eukaryotic host cell. Still more preferably, a recombinant host cell of the 
present invention is a vertebrate ceil. Preferably, a recombinant host cell of the 

25 invention is a mammalian, avian or fish cell. 

In another aspect, a recombinant host cell of the present invention is a 
prokaryotic host cell. Preferably, a recombinant host cell of the invention is a 
bacterial cell, preferably a strain of Escherichia coli. More preferably, a 
recombinant host cell comprises a polynucleotide under the transcriptional 

30 control of regulatory signals lunctional in the recombinant host cell, wherein the 
regulatory signals appropriately control expression of the piwi family 
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polypeptide in a manner to enable all necessary transcriptional and 
post-transcriptional modification. 

In yet another embodiment, the present invention contemplates a 
method of preparing a piwi family polypeptide comprising transfecting a cell 
5 with polynucleotide that encodes a p/m family polypeptide having activity in the 
modulation of growth, self-renewing division of stem cells (including genmline 
and somatic stem cells), cell proliferation (including activity in embryonic 
development as described in the Examples), proliferation of primordial germ 
ceils or other biological activity in accordance with the present invention, to 

10 produce a transformed host cell; and maintaining the transformed host ceil 
under biological conditions sufficient for expression of the polypeptide. More 
preferably, the transfomned host cell is a eukaryotic cell. More preferably still, 
the eukaryotic cell is a vertebrate cell. Alternatively, the host cell is a 
prokaryotic cell. More preferably, the prokaryotic cell is a bacterial cell of 

1 5 Escherichia colL Even more preferably, a polynucleotide transfected into the 
transformed cell comprises the nucleotide base sequence of any of SEQ ID 
NOs:1, 3 and 5. SEQ ID NOs:3-6 set forth nucleotide and amino acid 
sequences for representative vertebrates, mouse and human. Also 
contemplated by the present invention are homologues or biologically 

20 equivalent piwi family polynucleotides and polypeptides found in other 
vertebrates, particulariy warm blooded vertebrates. 

As mentioned above, in connection with expression embodiments to 
prepare recombinant p/tv/ family proteins and peptides, it is contemplated that 
longer DNA segments will most often be used, with DNA segments encoding 

25 the entire piw/family protein, functional domains or cleavage products thereof, 
being most preferred. However, it will be appreciated that the use of shorter 
DNA segments to direct the expression of p/iv/family peptides or epitopic core 
regions, such as may be used to generate anti-p/w/ family antibodies, also falls 
within the scope of the invention. 

30 DNA segments which encode peptide antigens from about 1 5 to about 

50 amino acids in length, or more preferably, from about 1 5 to about 30 amino 
acids in length are contemplated to be particulariy useful. DNA segments 
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encoding peptides will generally have a minimum coding length in the order of 
about 45 to about 150, or to about 90 nucleotides. DNA segments encoding 
ftjll length proteins may have a minimum coding length on the order of about 
4,000 nucleotides for a protein in accordance with any of SEQ ID NOs: 2, 4 and 
5 6. 

Naturally, the present invention also encompasses DNA segments which 
are complementary, or essentially complementary, to the sequences set forth 
in any of SEQ ID NOs:! , 3 and 5. The terms "complementary" and "essentially 
complementary" are defined above. Excepting intronic or flani<ing regions, and 

10 allowing for the degeneracy of the genetic code, sequences which have 
between about 70% and about 80%; or more preferably, between about 81% 
and about 90%; or even more preferably, between about 91 % and about 99%; 
of nucleotides which are identical or functionally equivalent (i.e. encoding the 
same amino acid) of nucleotides of any of SEQ ID N0s:1, 3 and 5, will be 

15 sequences which are '*a sequence essentially as set forth in any of SEQ ID 
NOs:1 , 3 and 5". Sequences which are essentially the same as those set forth 
in any of SEQ ID NOs:1, 3 and 5 may also be functionally defined as 
sequences which are capable of hybridizing to a nucleic acid segment 
containing the complement of any of SEQ ID NOs:1, 3 and 5 under relatively 

20 stringent conditions. Suitable relatively stringent hybridization conditions are 
described herein and will be well known to those of skill in the art. 

Bioloqicallv Functional Equivalents 
As mentioned above, modification and changes may be made in the 
structure of the piwi family proteins and peptides described herein and still 

25 obtain a molecule having like or othenvise desirable characteristics. For 
example, certain amino acids may be substituted for other amino acids in a 
protein structure without appreciable loss of interactive capacity with structures 
such as, for example, in the nucleus of a cell. Since it is the Interactive 
capacity and nature of a protein that defines that protein's biological functional 

30 activity, certain amino acid sequence substitutions can be made in a protein 
sequence (or, of course, its underiying DNA coding sequence) and 
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nevertheless obtain a protein with like or even counten/ailing properties (e.g., 
antagonistic v. agonistic). It is tlius contemplated by the inventor that various 
changes may be made in the sequence of thep/w/family proteins and peptides 
(or underlying DNA) without appreciable loss of their biological utility or activity. 
5 It is also well understood by the skilled artisan that, inherent In the 

definition of a biologically functional equivalent protein or peptide, is the 
concept that there is a limit to the number of changes that may be made within 
a defined portion of the molecule and still result in a molecule with an 
acceptable level of equivalent biological activity. Biologically functional 

1 0 equivalent peptides are thus defined herein as those peptides in which certain, 
not most or all, of the amino acids may be substituted. Of course, a plurality 
of distinct proteins/peptides with different substitutions may easily be made and 
used in accordance with the invention. 

It is also well understood that where certain residues are shown to be 

15 particularly important to the biological or structural properties of a protein or 
peptide, e.g., residues in active sites, such residues may not generally be 
exchanged. This is the case in the present invention, where if any changes, 
for example, in the PIWI box of p/w/ family polypeptides, could result in a loss 
of an aspect of the utility of the resulting peptide for the present invention. 

20 Amino acid substitutions, such as those which might be employed in 

modifying the p/w/ family proteins and peptides described herein, are generally 
based on the relative similarity of the amino acid side-chain substituents, for 
example, their hydrophobidty, hydrophilicity, charge, size, and the like. An 
analysis of the size, shape and type of the amino acid side-chain substituents 

25 reveals that arginine, lysine and histidine are all positively charged residues; 
that alanine, glycine and serine are all a similar size; and that phenylalanine, 
tryptophan and tyrosine all have a generally similar shape. Therefore, based 
upon these considerations, arginine, lysine and histidine; alanine, glycine and 
serine; and phenylalanine, tryptophan and tyrosine; are defined herein as 

30 biologically functional equivalents. 

In making such changes, the hydropathic index of amino acids may be 
considered. Each amino acid has been assigned a hydropathic index on the 



wo 00/32039 



PCTAJS99/28764 



-27- 

basis of their hydrophobicity and charge characteristics, these are: isoleucine 
(+ 4.5); valine (+ 4.2); leucine (+ 3.8); phenylalanine (+ 2.8); cysteine/cystine 
(+ 2.5); methionine (+ 1.9); alanine (+ 1.8); glycine (-0.4); threonine (-0.7); 
serine (-0,8); tryptophan (-0.9); tyrosine (-1.3); proline (-1,6); histidine (-^.2); 
5 glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (- 
3.9); and arginine (-4.5). 

The importance of the hydropathic amino acid Index in confening 
interactive biological function on a protein is generally understood in the art 
(Kyte & Doolittle, 1982, incorporated herein by reference). It is known that 

1 0 certain amino acids may be substituted Ibr other amino acids having a similar 
hydropathic index or score and still retain a similar biological activity. In making 
changes based upon the hydropathic index, the substitution of amino acids 
whose hydropathic indices are within ±2 of the original value is preferred, those 
which are within ±1 of the original value are particulariy preferred, and those 

15 within ±0.5 of the original value are even more particularly preferred. 

It is also understood in the art that the substitution of like amino acids 
can be made effectively on the basis of hydrophilicity. U.S. Pat No, 4,554,1 01 , 
incorporated herein by reference, states that the greatest local average 
hydrophilicity of a protein, as govemed by the hydrophilicity of its adjacent 

20 amino acids, correlates with its immunogenicity and antigenicity, i.e. with a 
biological property of the protein. It is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent protein. 

As detailed in U.S. Pat No. 4,554,101, the following hydrophilicity 

25 values have been assigned to amino acid residues: arginine (+ 3.0); lysine (+ 
3.0); aspartate (+ 3.0±1); glutamate (+ 3.0±1); serine (+ 0.3); asparegine (+ 
0.2); glutamine (+ 0.2); glycine (0); threonine (-0.4); proline (-0.5±1 ); alanine (- 
0,5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1,5); leucine (- 
1.8); isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4) . 

30 In making changes based upon similar hydrophilicity values, the 

substitution of amino acids whose hydrophilicity values are within ±2 of the 
original value is preferred, those which are within ±1 of the original value are 
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particulariy preferred, and those within ±0.5 of the original value are even more 
particularly preferred. 

Whiie discussion has focused on functionally equivalent poiypeptides 
arising from amino acid changes. It will be appreciated that these changes may 
5 be effected by alteration of the encoding DNA, taking Into consideration also 
that the genetic code is degenerate and that two or more codons may code for 
the same amino acid. 

Sequence Modification Techniques 
Modifications to the p/iv/ family proteins and peptides described herein 

1 0 may be canried out using techniques such as site directed mutagenesis. Site- 
specific mutagenesis is a technique useful in the preparation of individual 
peptides, or biologically functional equivalent proteins or peptides, through 
specific mutagenesis of the underlying DNA. The technique further provides 
a ready ability to prepare and test sequence variants, for example, 

1 5 incorporating one or more of the foregoing considerations, by introducing one 
or more nucleotide sequence changes Into the DNA. Site-specific mutagenesis 
allows the production of mutants through the use of specific oligonucleotide 
sequences which encode the DNA sequence of the desired mutation, as well 
as a sufficient number of adjacent nucleotides, to provide a primer sequence 

20 of sufficient size and sequence complexity to fomri a stable duplex on both 
sides of the deletion junction being traversed. Typically, a primer of about 17 
to 30 nucleotides in length is prefenred, with about 5 to 10 residues on both 
sides of the Junction of the sequence t>eing altered. 

In general, the technique of site-specific mutagenesis is well known In 

25 the art as exemplified by publications (e.g.. Adelman et al., 1983). As will be 
appreciated, the technique typically employs a phage vector which exists In 
both a single stranded and double stranded form. Typical vectors useful In 
site-directed mutagenesis include vectors such as the Ml 3 phage (Messing et 
al., 1981). These phage are readily commercially available and their use Is 

30 generally well known to those skilled in the art. Double stranded plasmids ar^ 



WO00A32Q39 



PCT/US99>28764 



-29- 

also routinely employed in site directed mutagenesis which eliminates the step 
of transferring the gene of interest from a plasmid to a phage. 

In general, site-directed mutagenesis in accordance herewith is 
performed by first obtaining a single-stranded vector or melting apart the two 
5 strands of a double stranded vector which includes within its sequence a DNA 
sequence which encodes, for example, a human p/iv/ family polypeptide (HIWI). 
An oligonucleotide primer bearing the desired mutated sequence is prepared, 
generally synthetically, for example by the method of Crea et al. (1 978). This 
primer is then annealed with the single-stranded vector, and subjected to DNA 

1 0 polymerizing enzymes such as E. coli polymerase I Klenow fragment, in order 
to complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex 
Is formed wherein one strand encodes the original non-mutated sequence and 
the second strand bears the desired mutation. This heteroduplex vector is then 
used to transform appropriate cells, such as E. coli cells, and clones are 

15 selected which include recombinant vectors bearing the mutated sequence 
arrangement. 

The preparation of sequence variants of the selected gene using site- 
directed mutagenesis is provided as a means of producing potentially useful 
piwi family polypeptide or other species capable of modulating growth, 

20 proliferation and self-renewing division of stem cells, and proliferation of 
primordial genm cells and is not meant to be limiting as there are other ways in 
which sequence variants of these peptides may be obtained. For example, 
recombinant vectors encoding the desired genes may be treated with 
mutagenic agents to obtain sequence variants (see, e.g., a method described 

25 by Eichenlaub, 1979) for the mutagenesis of plasmid DNA using 
hydroxylamine. 

Other Stmctural Equivalents 
In addition to the p/iv/ family peptidyl compounds described herein, the 
inventors also contemplate that other sterically similar compounds may be 
30 fomiulated to mimic the key portions of the peptide stmcture. Such compounds 
may be used in the same manner as the peptides of the invention and hence 
are also functional equivalents. The generation of a structural functional 
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equivalent may be achieved by the techniques of modeling and chemical 
design known to those of skill in the art. It will be understood that all such 
stericaliy similar constructs fall within the scope of the present invention. 

Iptrodugtion of Gene Products 
5 Where the gene itself is employed to introduce the gene products, a 

convenient method of introduction will be through the use of a recombinant 
vector which incorporates the desired gene, together with its associated control 
sequences. The preparation of recombinant vectors is well known to those of 
skill in the art and described in many references, such as, for example, 

10 Sambrook et al. (1989), specifically incorporated herein by reference. 

In vectors, it is understood that the DNA coding sequences to be 
expressed, in this case those encoding the piwi family gene products, are 
positioned adjacent to and under the control of a promoter. It is understood in 
the art that to bring a coding sequence under the control of such a pnDmoter, 

15 one generally positions the 5' end of the transcription initiation site of the 
transcriptional reading frame of the gene product to be expressed between 
about 1 and about 50 nucleotides "downstream" of (i.e., 3' of) the chosen 
promoter. One may also desire to incorporate into the transcriptional unit of the 
vector an appropriate polyadenylation site (e.g., 5'-AATAAA-3'), if one was not 

20 contained within the original inserted DNA. Typically, these poly A addition sites 
are placed about 30 to 2000 nucleotides ''downstream" of the coding sequence 
at a position prior to transcription temnination. 

While use of the control sequences of the specific gene (i.e., a piwi 
family promoter for a pAv/ family gene) will be prefenred, there is no reason why 

25 other control sequences could not be employed, so long as they are compatible 
with the genotype of the cell being treated. Thus, one may mention other 
useful promoters byway of example, including, e.g., an SV40 early promoter, 
a long terminal repeat promoter from rotrovinjs, an actin promoter, a heat 
shock promoter, a metallothionein promoter, and the like. 

30 As is known in the art, a promoter is a roglon of a DNA molecule 

typically within about 1 00 nucleotide pairs in front of (upstream of) the point at 
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Which transcription begins (i.e., a transcription start site). That region typically 
contains several types of DNA sequence elements that are located in similar 
relative positions in different genes. As used herein, the term "promoter^ 
includes what is referred to in the art as an upstream promoter region, a 
5 promoter region or a promoter of a generalized eukaryotic RNA Polymerase II 
transcription unit. 

Another type of discrete transcription regulatory sequence element is an 
enhancer. An enhancer provides specificity of time, location and expression 
level for a particular encoding region (e.g., gene). A major function of an 

1 0 enhancer is to increase the level of transcription of a coding sequence in a cell 
that contains one or more transcription factors that bind to that enhancer. 
Unlike a promoter, an enhancer can function when located at variable 
distances from transcription start sites so long as a promoter is present. 

As used herein, the phrase "enhancer-promoter" means a composite 

15 unit that contains both enhancer and promoter elements. An 
enhancer-promoter is operatively linked to a coding sequence that encodes at 
least one gene product. As used herein, the phrase "operatively linked" means 
that an enhancer-promoter is connected to a coding sequence in such a way 
that the transcription of that coding sequence is controlled and regulated by 

20 that enhancer-promoter Means for operatively linking an enhancer-promoter 
to a coding sequence are well known in the art. As is also well known in the 
art, the precise orientation and location relative to a coding sequence whose 
transcription is controlled, is dependent inter alia upon the specific nature of the 
enhancer-promoter. Thus, a TATA box minimal promoter is typically located 

25 from about 25 to about 30 base pairs upstream of a transcription initiation site 
and an upstream promoter element is typically located from about 1 00 to about 
200 base pairs upstream of a transcription initiation site. In contrast, an 
enhancer can be located downstream from the initiation site and can be at a 
considerable distance from that site. 

30 An enhancer-promoter used in a vector construct of the present 

invention can be any enhancer-promoter that drives expression in a cell to be 
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transfected. By employing an enhancer-promoter with well-known properties, 
the level and pattern of gene product expression can be optimized. 

For introduction of, for example, the human p/w/ family gene {hiwi\ It is 
proposed that one will desire to preferably employ a vector construct that will 
5 deliver the desired gene to the affected cells. This will, of course, generally 
require that the constmct be delivered to the targeted cells, for example, 
mammalian, avian or fish gemiline stem cells. It is proposed that this may be 
achieved most preferably by introduction of the desired gene through the use 
of a viral vector to cany the p/w/ family sequence to efficiently infect the cells, 

10 These vectors will preferably be an adenoviral, a retroviral, a vaccinia viral 
vector or adeno-associated virus. These vectors are preferred because they 
have been successfully used to deliver desired sequences to cells and tend to 
have a high infection efficiency. Suitable vector-p/iv/family gene constructs are 
adapted for administration as pharmaceutical compositions, as described 

15 herein below. 

Commonly used viral promoters for expression vectors are derived from 
polyoma, cytomegalovirus, Adenovims 2, and Simian Virus 40 (SV40). The 
eariy and late promoters of SV40 virus are particularly useful because both are 
obtained easily from the vims as a fragment which also contains the SV40 viral 

20 origin of replication. Smaller or larger SV40 fragments may also be used, 
provided there is included the approximately 250 bp sequence extending from 
the Hind III site toward the Bgl I site located in the viral origin of replication. 
Further, it is also possible, and often desirable, to utilize promoter or control 
sequences nonnally associated with the desired gene sequence, provided such 

25 control sequences are compatible with the host cell systems. 

The origin of replication may be provided either by constmction of the 
vector to include an exogenous origin, such as may be derived finom SV40 or 
other viral (e.g.. Polyoma, Adeno, VSV. BPV) source, or may be provided by 
the host cell chromosomal replication mechanism. If the vector is integrated 

30 into the host cell chromosome, the latter is often sufficient. 

Where a piwi famWy gene itself is employed it will be most convenient to 
simply use a wild type p/w/ family gene directly. However, it is contemplated 
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that certain regions of a p/Vw family gene may be employed exclusively without 
employing an entire wild type piwi family gene. It is proposed that it will 
ultimately be preferable to employ the smallest region needed to modulate 
stem cell growth, proliferation and self-renewing division, and/or primordial 
5 germ cell proliferation, so that one is not introducing unnecessary DNA into 
cells which receive a p/vw family gene construct. Techniques well known to 
those of skill in the art, such as the use of restriction enzymes, will allow for the 
generation of small regions of an exemplary piwi family gene. The ability of 
these regions to modulate cell signaling can easily be detennined by the 
10 assays reported in the Examples. In general, techniques for assessing the 
modulation of stem cell growth, proliferation and self-renewing division, along 
with primordial germ cell proliferation are known in the art. 

Cell Culture Techniques 
As disclosed herein, PIWI functions both as a cytoplasmic protein 

15 (during embryonic germline development) and as nuclear protein during stem 
cell division. PIWI has a cell-autonomous function to promote stem cell division 
in addition to its signaling function in maintaining stem cells. The dose of PIWI 
gene determines the number of germline stem cells fomried in the embryo. 
In accordance with the present invention, piwi family genes and gene 

20 products can thus be used to expand a primitive cell population in vitro. The 
term "primitive ceir is adopted herein for convenience to collectively refer to 
stem cells, primordial germ cells and to other primitive cells having totipotent 
or pluripotent characteristics, or other characteristics of stem cells and 
primordial genu cells. The temi "stem ceir is meant to have its art-recognized 

25 meaning, and to include, but not be limited to, the following examples of stem 
cells: hematopoietic stem cells, neural stem cells, and gemiline stem cells. The 
temn "primordial germ cell" is also meant to have its art-recognized meaning. 
Such expanded primitive cell populations are utilized for therapeutic or genetic 
engineering purposes in human and mammalian, avian, and fish systems. 
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The term "sustained" as used herein with respect to primitive cells, and 
cultures of the same, refers to a cell or cell culture capable of undergoing 
further cell division, even If the cells are eventually subject to senescence. 
A method of culturing primitive cells is thus envisioned in accordance 
5 with the. present invention. The method comprises providing a culture 
comprising a primitive cell, and delivering to the provided cell a p/iv/famiiy gene 
product, whereby growth of the cell in the culture is accomplished. Preferably, 
the primitive cell is maintained in culture for a time sufficient to produce a 
sustained cell culture. 

1 0 Optionally, the piwi family gene product is delivered to the primitive cell 

by delivering to the primitive cell an effective amount of a DNA molecule 
comprising a polynucleotide that encodes a biologically active piwi family 
polypeptide; and maintaining the primitive cell under conditions sufficient for 
expression of the polypeptide. Alternatively, the piwi family gene product is 

15 delivered to the cell by administration of a biologically active piwi family 
polypeptide to the culture via the culture media or via a feeder matrix included 
in the culture. 

A prefen^ed embodiment of the method comprises: (a) collecting 
primitive cells; (b) depositing the primitive cells in contact with a feeder matrix, 

20 the feeder matrix comprising a piwi family gene product; and (c) growing the 
primitive cells on the feeder matrix in the presence of media for a time sufficient 
to produce a sustained primitive cell, culture. 

Thus, typically, the cultures of the present invention include a feeder 
matrix as defined herein. A feeder matrix can either be cells or cell lines 

25 cultured for the purpose of obtaining a sustained primitive cell culture. In this 
case the cell line can be transformed with a piwi family gene product in 
accordance with techniques disclosed herein so as to produce the p/w/ family 
gene product. Alternatively, a feeder matrix can be derived from or provided 
by the organ or tissue in which the primitive cells are located, e.g., the gonad. 

30 Alternatively, the feeder cells comprising the feeder matrix could be substituted 
with extracellular matrix plus bound growtii factors, and particulariy including 
a piwi family gene product 
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A feeder matrix as used herein is constructed in accordance with 
procedures known in the art. As noted above, it is preferred that the feeder 
matrix be preconditioned. By the term "preconditioned" it is meant that the 
feeder matrix is cultured in the presence of media for a period of time prior to 
5 the depositing of primitive cells in contact with the feeder matrix, e.g. a time 
sufficient to initiate and establish production of, for example, growth factors or 
other factors by the feeder matrix, and particulariy including a p/wi family gene 
product. Thus, a feeder matrix can be preconditioned by culturing the feeder 
matrix by itself for one to two days prior to the depositing of primitive ceils in 

1 0 contact with the feeder matrix. 

Candidate cells for use in the feeder matrix are disclosed in the 
Examples. Additionally, the feeder matrix can comprise Sertoli cells or 
fibroblast cells, Including mouse fibroblast cells. STO fibroblasts and primary 
fibroblasts are suitable examples. Also, it is contemplated that feeder matrices 

15 comprising cells from other murine species (e.g., rat); other mammalian 
species (e.g., ungulate, bovine, and porcine species); or avian species (e.g. 
Gallinacea, chicken, turkey, duck, goose, quail and pheasant) may also be 
used. 

The culturing method of the present invention can optionally comprise 
20 establishing a monolayer of feeder cells. Feeder cells may mitotically 
inactivated using standard techniques. For example, the feeder cells may be 
exposed to gamma radiation (e.g. 4000 Rads of gamma radiation) or may be 
treated with Mitomycin C (e.g. 10 A^g/ml for 2-3 hours). Procedures for 
mitotically Inactivating cells are also detailed in the infomiation typically sent 
25 withcellsfromtheAmericanTypeCultureCollection (ATCC), 10801 University 
Boulevard, Manassas, VA 20110-2209 (e.g. STO feeder cells are available 
under ATCC accession number 1 503). Monolayers may optionally be cultured 
to about 80% confluency, preferably to about 90% confluency, and more 
preferably about 100% confluency. While configuration of the feeder cells as 
30 a nnonolayer is the preferred configuration for the culture, any suitable 
configuration is contemplated to be within the scope of the present invention. 
Thus, for example, layers, monolayers, clusters, aggregates or other 
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associations or groupings of feeder cells are contemplated to fall within the 
scope of the present invention and are particularly contemplated to fall with the 
meaning of the term "matrix". 

Thus, the primitive cells can extend non-uniformly through the feeder 
5 cells veritically, horizontally, diagonally, or in any combination thereof, such that 
cell aggregates are fomned. In accordance with the present invention, it Is thus 
contemplated that the cultures can be fomned by mixing primitive cells with 
appropriate feeder cells and inserting the mixture into a suitable culture 
container, such as a multi-well plate. 

10 The media used in canrying out the present invention may be any 

suitable media. The media may be a conditioned media or a synthetic media, 
both of which are known in the art. Conditioned media, and particulariy BRL 
conditioned media, is cun^ently preferred. Byway of example, BRL conditioned 
media is prepared according to art-recognized techniques, such as described 

15 by Smith, A. G. and Hooper, M. L., Dev. BioL 1987 May; 121(1): 1-9. BRL cells 
are available from ATCC under accession number CRL-1442. The media can 
be supplemented with a p/vw family gene product. The media can also be 
supplemented with growth factors, including but not limited to leukemia 
inhibitory factor (LIF), insulin-like growth factor (IGF), fibroblast growth factor 

20 (FGF), basic fibroblast growth factor (bFGF), stem cell factor (SCF - also called 
steel factor or SF), transforming growth factor-|31 (TGF-pl) and anti-retinoic 
acid. 

Cell cultures of the present invention may be formulated for 
administration to animals by dissociating the cells (e.g., by mechanical 

25 dissociation) and Intimately admixing the cells with a phamiaceutically 
acceptable earner (e.g., phosphate-buffered saline (PBS) solution). Primitive 
cells in such fomiulations may be prepared to carry a heterologous DNA 
sequence into a subject, preferably a vertebrate subject and more preferably 
a wamvblooded vertebrate subject, in the manner described in greater detail 

30 below. 

The primitive cells cultured in accordance with the present invention are 
useful, among other things, as a tool for the study of embryological 
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development (i.e., by labeling the cells with a marker gene and observing their 
distribution after injection in vivo) and the production of transgenic animals. 
They are useful in allowing the application of homologous recombination to the 
production of transgenic animals. 
5 The primitive cells should also be useful for the production of chimeric 

animals, including transgenic chimeric animals. It is contemplated that 
chimeric or transgenic chimeric non-human animals, including particulariy 
chimeric or transgenic chimeric avians (i.e. birds), will be useHil Ibrthe recovery 
of heterologous proteins from the animals. In the case of an avian, the protein 

10 can be recovered directly from the eggs of such chimeric or transgenic 
chimeric avians. Such avians can be thus be used for the production and 
recovery of therapeutic proteins and other polypeptides, including p/iv/ family 
polypeptides themselves. 

The following U.S. Patents pertain to the culturing of stem cells an are 

15 used herein to illuminate the background of the invention, and in particular 
cases, to provide additional details respecting the practice. Each of the 
following U.S. Patent is incorporated herein by reference in its entirety: U.S. 
Patent No. 5,690,926 issued November 25, 1997 to Hogan; U.S. Patent No. 
5,670,372 issued September 23, 1997 to Hogan; U.S. Patent No. 5,537,357 

20 issued September 26, 1995 to Hooan: U.S. Patent No. 5,340,740 issued 
August 23, 1994 to Petitte et al. : U.S. Patent No. 5,656,479 issued August 12, 
1 997 to Petitte etal.: and U.S. Patent No. 5,830,51 0 issued November 3, 1 998 
to Petitte etal. 

Transgenic Animals 

25 It is also contemplated to be within the scope of the present invention 

to prepare a transgenic non-human animal which expresses a p/w/family gene 
of the present invention. A preferred transgenic animal is a mouse. 

The tenii transgene" refers to exogenous genetic material which does 
not naturally form part of the genetic material of an animal to be genetically 

30 altered but can be incorporated into the germ and/or somatic cells of that 
animal by standard transgenic techniques. The temi "heterologous DNA" 
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refers to DNA which has been transferred from one individual animal, species 
or breed to a different individual animal, species or breed. The term 
"transgenic" refers to cells, tissues, embryos, fetuses or animals which cany 
one or more transgenes. The tenri ''chimeric" refers to an embryo, fetus or 
5 animal which consists of two or more tissues of different genetic composition. 

Techniques for the preparation of transgenic animals are known in the 
art. Exemplary techniques are described in U.S. Patent No. 5,489,742 
(transgenic rats); U.S. Patent Nos. 4,736,866, 5,550.31 6, 5.61 4.396, 5,625.1 25 
and 5.648,061 (transgenic mice); U.S. Patent No. 5,573.933 (transgenic pigs); 

10 5,162,215 (transgenic avian species) and U.S. Patent No. 5,741,957 
(transgenic bovine species), the entire contents of each of which are herein 
incorporated by reference. 

With respect to a representative method for the preparation of a 
transgenic mouse, cloned recombinant or synthetic DNA sequences or DNA 

1 5 segments encoding a p/w/family gene product are injected into fertilized mouse 
eggs (e.g. an embryo). The injected eggs are implanted in pseudo pregnant 
females and are grown to term to provide transgenic mice whose cells express 
a p/w/family gene product Preferably, the injected sequences are constructed 
having promoter sequences connected so as to express the desired protein In 

20 gemiline stem cells or other primitive cells of the transgenic mouse. 

Fish represent a category of animals of interest for agricultural and 
ecological reasons. Representative fish species include, but are not limited to. 
treut. salmon, carp, shark, ray. flounder, sole, tilapia, medaka, goldfish, guppy, 
molly, platyfish, swordtail. zebrafish, loach, catfish, and the like. 

25 Representative methods for the preparation of a transgenic fish have been 
described by Ozato et al. Cell Differ., 19:237-244 (1986). Inoue et al. Cell 
Differ. Dev., 29:123-128 (1990). Rokkones et al. J. Comp. Physiol. B. 
158:751-758 (1989), and Guyomard et al. Biochimie, 71:857-863 (1989), 
describing preparation of transgenic medaka, medaka, salmon and treut, 

30 respectively. 
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The term "avian" as used herein refers to any avian species, including 
but not iimited to Gallinacea sp., chicken, turkey, duck, goose, quail and 
pheasant. Chicken is cunrently preferred. 

With respect to an exemplary method for the preparation of a transgenic 
5 avian species, certain donor cell types have been isolated that retain viability 
when injected into recipient embryos. See Etches et al.. Poultry Science 
72:882-887 (1993); Etches etal.. In /I wan Incubation, Chapter 22, Buttenfl^orth 
Publishers (1990); VeninderGibbinsetal., Foi/rt/7 World Congress on Genetics 
Applied to Livestock Production, Edinburgh, (1990); Petitte etal., Development 

10 108, 185-189 (1990)). These studies showed that blastodemnal cells derived 
from Stage X embryos (embryo at oviposition) remained viable when 
transferred to comparable recipient Stage X embryos. 

Thus, the present invention provides a new method of altering the 
phenotype of an animal, such as a bird, and the animals so produced with the 

15 primitive cells cultured in accordance with techniques disclosed herein. The 
method comprises transfecting primitive cells cultured in accordance with the 
methods disclosed herein with the DNA sequence in vitro (e.g., by 
electroporation or transformation with a retroviral vector), and then injecting the 
transfected primitive cells into an embryo, (e.g. an egg containing an embryonic 

20 bird via the yolk sac or onto the chorioallantoic membrane), with the DNA 
sequence being effective to cause a change in phenotype in the embryonic 
animal (e.g., a change in protein expression, a change in growth rate, feed 
efficiency, disease resistance, or a combination of all of these factors). 
Preferably, the primitive cells are also transfected with a piwi femily gene to 

25 fecilitate production of germ cells in the transgenic or chimeric bird and thereby 
also facilitate germline transmission of the DNA sequence of Interest. 

Preferably, in the case of a bird, the egg into which the DNA is 
introduced is incubated to hatch, and the bird so produced is raised to at least 
an age at which the change in phenotype is expressed. It is of no deleterious 

30 consequence if the transfomned embryo and bird is chimeric, so long as a 
physiological response is achieved in the animal after hatch sufficient to evoke 
a measurable phenotypic change measured in any suitable way. By way of 
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additional example, protein expression in the chimera, or transgenic offspring 
thereof produced by gemn line transmission of the DNA sequence, may be 
established through incorporation of an inducible promoter into the Introduced 
DNA sequence coupled with treatment of the chimera, or transgenic offspring 
5 thereof, with an agent which induces the promoter. 

The mechanism of in ovo injection is not critical, but it is preferred that 
the method not unduly damage the tissues and organs of the embryo or the 
extraembryonic membranes sunnounding it so that the treatment will not 
decrease hatch rate. For example, in ovo injections can be earned out by 

1 0 forming a window in a bird egg and then using a micro-pipette or needle for the 
injection, such a 50 /^m-diameter micro-pipette or a 50 //m-diameter needle. 
If desired, the bird egg can be sealed with a substantially bacteria-impermeable 
sealing material such as wax or the like to prevent subsequent entry of 
undesirable bacteria. Suitable sealing material may optionally comprise 

1 5 biologically acceptable antimicrobial compositions. 

The DNA sequence introduced with the primitive cells in accordance 
with the present invention is, in general, a construct comprised of a promoter 
functional in the cells of the candidate animal and a gene encoding a peptide 
or protein operably linked to the promoter. Preferably, the protein or peptide is 

20 physiologically active and capable of producing a phenotypic change in the 
animal. In general, the DNA construct may be a linear DNA sequence 
(introduced Into the primitive cells of the invention by electroporation) or a 
sequence canied by a vector or other suitable canier for transforming the 
primitive cells of the invention using various techniques known in the art, such 

25 as liposomes, calcium phosphate, electroporation, or DMSO. Vectors, as 
discussed herein, may be plasmids, viruses (including retroviruses), and 
phage, whether in native form or derivatives thereof. 

The production of cloned genes, recombinant DNA, vectors, transformed 
host cells, proteins and protein fragments by genetic engineering Is well known. 

30 See e.g., U.S. Pat No. 4,761 ,371 to BeH et al. at Col. 6 line 3 to Col. 9 line 65; 
U.S. Pat. No. 4,877,729 to Clari< et al. at Col. 4 line 38 to Col. 7 line 6; U.S. 
Pat. No. 4,912,038 to Schillino at Col. 3 line 26 to Col, 14 line 12. Protocols for 
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restriction endonuclease digestion, preparation of vectors, DNA purification and 
otiier such procedures are essentially as described in standard cloning 
manuals. See Sambrook et al.. Molecular Cloning, a Laboratory Manual, (2d 
Ed., Cold Spring Hariaor Press, N.Y. (1989)). 
5 As disclosed herein, a vector is a replicable DNA construct used herein 

to either amplify and/or express DNA encoding the gene of interest. A suitable 
expression vector will have controlling elements capable of expressing the 
cloned cDNA or genomic DNA placed in the correct orientation when the vector 
is introduced into the correct host. Such elements typically include but are not 

10 limited to a promoter region which interacts specifically with cellular proteins 
involved in transcription or which may be inducible by administration of an 
inducing agent, enhancer elements which can stimulate transcription many-fold 
from linked heterologous promoters, a splice acceptor and/or donor sequence, 
and tennination and polyadenylation signals. Also required is the sequence for 

15 a ribosome binding site capable of permitting translation and which is operably 
linked to the gene to be expressed. Recently, a muscle-specific promoter has 
been isolated which is positioned upstream of both the skeletal muscle 
structural gene and the essential proximal promoter element and is operably 
associated with each, (Mar and Ordahl, Proc. Natl. Acad, ScL USA 85, 

20 6404-6408 (1988)). Vectors comprise plasmlds, viruses (e.g. adenovirus, 
cytomegalovims), phage, and DNA fragments integratable into the host 
genome by recombination. The vector may optionally replicate and function 
independently of the host genome, or may in some instances integrate into the 
genome itself. 

25 The piwi family genes of the present invention can thus be used to 

create transgenic animals with increased number of primordial germ cells. 
Such animals in mammalian, avian, and fish systems should provide an 
improved platform for germline-mediated gene transfer to create transgenic 
stocks. 

30 The genetically modified animals could express transgenes of 

commercial interest, including those having therapeutic or prophylactic value 
to the animal itself or to its offspring. For example, the transgene of interest 
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can encode a protein or peptide including, but not limited to, growth homrione, 
thyroid releasing homnone (TRH), Marek's MDX, and immunogenic 
recombinant antigens such as that for coccidiosls. 

Altematively, the genetically modified animals can be employed in the 
5 production of therapeutic agents. For example, genetically modified avian 
species that can lay eggs containing dmgs, proteins and antibodies to ward off 
illness that have been produced via the insertion the genes that make the 
proteins into viral and other vectors to get them into the birds, p/w/ family gene 
products can also be produced and isolated in this manner. 
10 In accordance with the present invention , piwi family genes can also be 

inserted into such genetically modified animals to create animals with increased 
number of primordial gemi cells, thereby enhancing the likelihood of gennline 
transmission of the transgene encoding the therapeutic agent due to the 
increased number of primordial germ cells. 

15 Pharmaceutical Compositions 

In a prefenred embodiment, the present invention provides 
phannaceutical compositions comprising a polypeptide or polynucleotide of the 
present invention and a physiologically acceptable carrier. More preferably, a 
pharmaceutical composition comprises a piwi family polypeptide or a 

20 polynucleotide that encodes those polypeptides. 

A composition of the present invention is typically administered 
parenterally in dosage unit fonmulations cx)ntaining standard, well-known 
nontoxic physiologically acceptable earners, adjuvants, and vehicles as 
desired. The tenm "parenteral" as used herein includes intravenous, 

25 intra-muscular, intra-arterial injection, or infusion techniques. 

Injectable preparations, for example sterile injectable aqueous or 
oleaginous suspensions, are fomiulated according to the known art using 
suitable dispersing or wetting agents and suspending agents. The sterile 
injectable preparation can also be a sterile injectable solution or suspension in 

30 a nontoxic parenterally acceptablediluent or solvent, for example, as a solution 
in 1,3-butanediol. 
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Among the acceptable vehicles and solvents that may be employed are 
water, Ringer's solution, and isotonic sodium chloride solution. In addition, 
sterile, fixed oils are conventionally employed as a solvent or suspending 
medium. For this purpose any bland fixed oil can be employed including 
5 synthetic mono- or di-glycerides. In addition, fatty acids such as oleic acid find 
use in the preparation of injectables. 

Prefened earners include neutral saline solutions buffered with 
phosphate, lactate, Tris, and the like. Of course, one purifies the vector 
sufficiently to render it essentially free of undesirable contaminants, such as 
10 defective interfering adenovirus particles or endotoxins and other pyrogens 
such that it does not cause any untoward reactions in the individual receiving 
the vector constmct. A prefenred means of purifying the vector involves the use 
of buoyant density gradients, such as cesium chloride gradient centrifugation. 
A transfected cell can also serve as a carrier By way of example, a liver 
1 5 cell can be removed from an organism, transfected with a polynucleotide of the 
present invention using methods set forth above and then the transfected cell 
retumed to the organism (e.g. injected Intra-vascularly). 



Generation of Antibodies 
In still another embodiment, the present invention provides an antibody 
20 immunoreactive with a polypeptide of the present invention. Preferably, an 
antibody of the invention is a monoclonal antibody. Means for preparing and 
characterizing antibodies are well known in the art (See, e.g., Antibodies A 
Laboratory Manual, E. Howell and D. Lane, Cold Spring Hari3or Laboratory, 
1988). 

25 Briefly, a polyclonal antibody is prepared by immunizing an animal with 

an immunogen comprising a polypeptide or polynucleotide of the present 
invention, and collecting antisera from that immunized animal. A wide range 
of animal species can be used for the production of antisera. Typically an 
animal used for production of anti-antisera is a rabbit, a mouse, a rat, a 

30 hamster or a guinea pig. Because of the relatively large blood volume of 
rabbits, a rabbit is a preferred choice for production of polyclonal antibodies. 
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As is well known in the art, a given polypeptide or polynucleotide may 
vary in its immunogenicity. It is often necessary therefore to couple the 
fmmunogen (e.g., a polypeptide or polynucleotide) of the present invention) 
with a canier. Exemplary and preferred earners are keyhole limpet 
5 hemocyanin (KLH) and bovine serum albumin (BSA). Other albumins such as 
ovalbumin, mouse serum albumin or rabbit serum albumin can also be used as 
canlers. 

Means for conjugating a polypeptide or a polynucleotide to a canier 
protein are well known in the art and include glutaraldehyde, 

10 m-maleimidobencoyl-N-hydroxysuccinimide ester, carbodiimlde and 
bis-biazotized benzidine. 

As is also well known in the art, immunogencity to a particular 
immunogen can be enhanced by the use of non-specific stimulators of the 
immune response known as adjuvants. Exemplary and prefen^ed adjuvants 

15 include complete Freund's adjuvant, incomplete Freund's adjuvants and 
aluminum hydroxide adjuvant. 

The amount of immunogen used of the production of polyclonal 
antibodies varies, inter alia, upon the nature of the immunogen as well as the 
animal used for immunization. A variety of routes can be used to administer 

20 the immunogen, e.g. sut>cutaneous, intramuscular, intradermal, intravenous 
and intraperitoneal. The production of polyclonal antibodies is monitored by 
sampling blood of the immunized animal at various points following 
immunization. When a desired level of Immunogenicity is obtained, the 
immunized animal can be bled and the seaim Isolated and stored. 

25 In another aspect, the present invention contemplates a method of 

producing an antibody Immunoreactlve with a piwi family polypeptide, the 
method comprising the steps of (a) transfecting recombinant host cells with a 
polynucleotide that encodes that polypeptide; (b) culturing the host cells under 
conditions sufficient for expression of the polypeptide; (c) recovering the 

30 polypeptide; and (d) preparing antibodies to the polypeptide. Preferably, the 
piwi family polypeptide possess a biological activity in accordance with the 
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present invention. Even more preferably, the present invention provides 
antibodies prepared according to the method described above. 

A monoclonal antibody of the present invention can be readily prepared 
through use of well-known techniques such as those exemplified in U.S. Pat 
5 No 4,196,265, herein incorporated by reference. Typically, a technique 
involves first immunizing a suitable animal with a selected antigen (e.g., a 
polypeptide or polynucleotide of the present Invention) in a manner sufficient 
to provide an immune response. Rodents such as mice and rats are prefen^ed 
animals. Spleen cells from the immunized animal are then fused with cells of 

10 an immortal myeloma cell. Where the immunized animal is a mouse, a 
prefen-ed myeloma cell is a murine NS-1 myeloma cell. 

The fused spleen/myeloma cells are cultured in a selective medium to 
select fused spleen/myeloma cells from the parental cells. Fused cells are 
separated from the mixture of non-fused parental cells, for example, by the 

1 5 addition of agents that block the de novo synthesis of nucleotides in the tissue 
culture media. Exemplary and preferred agents are aminopterin, methotrexate, 
and azaserine. Aminopterin and methotrexate block de novo synthesis of both 
purines and pyrimidines, whereas azaserine blocks only purine synthesis. 
Where aminopterin or methotrexate is used, the media is supplemented with 

20 hypoxanthine and thymidine as a source of nucleotides. Where azaserine is 
used, the media is supplemented with hypoxanthine. 

This culturing provides a population of hybridomas from which specific 
hybridomas are selected. Typically, selection of hybridomas is perfomned by 
culturing the cells by single-done dilution in microtiter plates, followed by 

25 testing tiie individual clonal supematants for reactivity with an 
antigen-polypeptides. The selected clones can then be propagated indefinitely 
to provide the monoclonal antibody. 

By way of specific example, to produce an antibody of the present 
invention, mice are injected intraperitoneally with between about 1-200 pg of 

30 an antigen comprising a polypeptide of the present invention. B lymphocyte 
cells are stimulated to grow by injecting the antigen in association with an 
adjuvant such as complete Freund's adjuvant (a non-specific stimulator of the 
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immune response containing l^illed Mycobacterium tuberculosis). At some time 
(e.g., at least two weelcs) after the first injection, mice are boosted by Injection 
with a second dose of the antigen mixed with incomplete Freund's adjuvant. 
A few weeks after the second injection, mice are tail bled and the sera 
5 titered by immunoprecipitation against radiolabeled antigen. Preferably, the 
method of boosting and titering is repeated until a suitable titer is achieved. 
The spleen of the mouse with the highest titer is removed and the spleen 
lymphocytes are obtained by homogenizing the spleen with a syringe. 
Typically, a spleen from an immunized mouse contains approximately 5x1 0^ to 

10 2x1 0^ lymphocytes. 

Mutant lymphocyte cells known as myeloma cells are obtained from 
laboratory animals in which such cells have been induced to grow by a variety 
of well-known methods. Myeloma cells lack the salvage pathway of nucleotide 
biosynthesis. Because myeloma cells are tumor cells, they can be propagated 

15 indefinitely in tissue culture, and are thus denominated immortal. Numerous 
cultured cell lines of myeloma cells fn^m mice and rats, such as murine NS-1 
myeloma cells, have been established. 

Myeloma cells are combined under conditions appropriate to foster 
fusion with the nomial antibody-producing cells from the spleen of the mouse 

20 or rat injected with the antigen/polypeptide of the present invention. Fusion 
conditions include, for example, the presence of polyethylene glycol. The 
resulting fused cells are hybridoma ceils. Like myeloma cells, hybridoma cells 
grow indefinitely in culture. 

Hybridoma cells are separated from unfused myeloma cells by culturing 

25 in a selectbn medium such as HAT media (hypoxanthine, aminopterin, 
thymidine). Unfused myeloma cells lack the enzymes necessary to synthesize 
nucleotides from the salvage pathway because they are killed in the presence 
of aminopterin. methotrexate, or azaserine. Unfused lymphocytes also do not 
continue to grow in tissue culture. Thus, only cells that have successfully fused 

30 (hybridoma cells) can grow in the selection media. 

Each of the sun^iving hybridoma cells produces a single antibody. 
These cells are then screened for the production of the specific antibody 
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immunoreactive with an antigen/polypeptide of the present invention. Single 
cell hybridomas are isolated by limiting dilutions of the hybridomas. The 
hybridomas are serially diluted nnany times and, after the dilutions are allowed 
to grow, the supematant is tested for the presence of the monoclonal antibody. 
5 The clones producing that antibody are then cultured in large amounts to 
produce an antibody of the present invention in convenient quantity. 

By use of a monoclonal antibody of the present invention, specific 
polypeptides and polynucleotide of the invention can be recognized as 
antigens, and thus identified. Once identified, those polypeptides and 

10 polynucleotide can be isolated and purified by techniques such as 
antibody-affinity chromatography. In antibody-affinity chromatography, a 
monoclonal antibody is bound to a solid substrate and exposed to a solution 
containing the desired antigen. The antigen is removed from the solution 
through an immunospecific reaction with the bound antibody. The polypeptide 

15 or polynucleotide is then easily removed from the substrate and purified. 

Screening Assays 
In yet another aspect, the present invention contemplates a method of 
screening substancesfortheirability to affect or modulate the biological activity 
of piwi family gene products. The present invention also contemplates a 

20 method of screening substances for their ability to affect or modulate the 
biological activity of p/w/family gene products to thereby affect or modulate the 
growth, proliferation and self-renewing division of stem cells, particularly 
germline stem cells, and proliferation of primordial gemn cells. Utilizing the 
methods and compositions of the present invention, screening assays for the 

25 testing of candidate substances can be derived. A candidate substance is a 
substance which potentially can promote or inhibit the biological activity of a 
p/iy/family gene product by binding, or other intermolecular interaction, with the 
pAv/ family gene product. 

An exemplary method of screening candidate substances for their ability 

30 to modulate piwi family biological activity comprises the steps of: (a) 
establishing replicate test and control samples that comprise a biologically 
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active p/w/ family polypeptide; (b) administering a candidate substance to test 
sample but not the control sample; (c) measuring the biological activity of the 
p/iv/ family polypeptide in the test and the control samples; and (d) determining 
that the candidate substance modulates piwi family biological activity if the 
5 biological activity of the piwi family polypeptide measured for the test sample 
is greater or less than the biological activity of the piwi family polypeptide 
measured for the control sample. The biological activities of piwi family that 
may optionally be examined in connection with a screening assay of the 
present invention comprise modulating the growth, proliferation and self- 
1 0 renewing division of stem cells, particularly germline stem cells, proliferation of 
primordial germ cells or other biological activity in accordance with the present 
invention. 

The replicate test and control samples can further comprise a cell that 
expresses a biologically active p/w/ family polypeptide. The present invention 

15 also contemplates a recombinant cell line suitable for use in the exemplary 
method. A candidate substance identified according to the screening assay 
described herein is contemplated to have the ability to modulate piwi family 
biological activity. Such as candidate compound has utility in the treatment of 
disorders and conditions associated with the biological activity of p/w/ family. 

20 In a cell-free system, the method comprises the steps of establishing a 

control system comprising a p/vw family polypeptide and a ligand wherein tiie 
p/ivi family polypeptide is capable of binding to the ligand; establishing a test 
system comprising the piwi femily polypeptide, the ligand, and a candidate 
compound; measuring ttie binding afTinity of ttie p/w/family polypeptide and the 

25 ligand in the control and tiie test systems; and determining that the candidate 
compound modulates piwi family activity in a cell-firee system if the binding 
affinity measured for tiie test system is less tiian or greater ttian tiie binding 
affinity measured for the control system. An exemplary ligand comprises a 
monoclonal antibody. 

30 A screening assay of tiie present invention may also involve detenmining 

the ability of a candidate substance to modulate, i.e. inhibit or promote piwi 
family biological activity and preferably, to thereby modulate the growth, 
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prollferation and self-renewing division of stem cells, particulariy gennline stem 
cells, proliferation of primordial germ cells or division of other target cells. 
Target cells can be either naturally occuning cells known to contain a 
polypeptide of the present invention or transfomned cell produced in 
5 accordance with a method of transformation set forth hereinbefbre. 

As is well known in the art, a screening assay provides a ceil under 
conditions suitable for testing the modulation of p/iv; family biological activity or 
of growth, proliferation and self-renewing division of stem cells, particularly 
gemiline stem cells, or of proliferation of primordial genu cells. These 

1 0 conditions include but are not limited to pH, temperature, tonicity, the presence 
of relevant metabolic factors (e.g., metal ions such as for example Ca**, growth 
factor, interleukins, or colony stimulating factors), and relevant modifications to 
the polypeptide such as glycosylation or prenylation. It is contemplated that a 
polypeptide of the present invention can be expressed and utilized in a 

15 prokaryotic or eukaryotic cell. The host cell can also be fractionated into 
sub-cellular fractions where the receptor can be found. For example, cells 
expressing the polypeptide can be fractionated into the nuclei, the endoplasmic 
reticulum, vesicles, or the membrane surfaces of the cell. 

in a preferred embodiment, temperature is from about 20''C. to about 

20 SO'^C, more preferably from about ZO^'C. to about 40*'C. and, even more 
preferably about ST^'C. Osmolality is preferably fifom about 5 milliosmols per 
liter (mosm/L) to about 400 mosm/l and, more preferably from about 200 
milliosmols perliterto about 400 mosm/l and, even more preferably from about 
290 nrK>sm/L to about 310 mosm/L. The presence of facbrs can be raquired 

25 for the proper testing of p/w/ family biological activity modulation in specific 
cells. Such factors include, for example, the presence and absence 
(withdrawal) of growth factor, interleukins, or colony stimulating factors. U.S. 
Patent Nos. 5,837.479; 5,645,999; 5,786,152; 5,739,278; and 5,352,660 also 
describe exemplary screening assays, and the entire contents of each are 

30 herein incorporated by reference. 

In one embodiment, a screening assay is designed to be capable of 
discriminating candidate substances having selective ability to interact with one 
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or more of the polypeptides of the present Invention but which polypeptides are 
without a substantially overlapping activity with another of those polypeptides 
identified herein. Exemplary assays including genetic screening assays and 
molecular biology screens such as a yeast two-hybrid screen which will 
5 effectively identify p/iVHnteracting genes important for p/wZ-mediated cell 
growth, proliferation and self-renewing division processes. One version of the 
yeast two-hybrid system has been descrit)ed (Chien et al., 1991, Prvc. Natl. 
Acad. ScL USA, 88:9578-9582) and is commercially available from Clontech, 
Palo Alto, California. 

10 Therapeutic Methods 

As used herein, the tenns "p/w/family activity" and "p/Vw family biological 
activity" are meant to be synonymous and are meant to refer to any biological 
activity of a piwi family polypeptide. Exemplary biological activities of piwi 
family comprise activity in the modulation of growth, pnDliferation and self- 

1 5 renewing division of stem cells (including germline and somatic stem cells), cell 
proliferation (including activity in embryonic development as described in the 
Examples), proliferation of primordial gemn cells or other biological activity In 
accordance with the present invention. The biological activity can be 
accomplished by endogenous piwi family polypeptides or by piwi family 

20 polypeptide administered to a subject, indeed, an isolated and purified piwi 
family polypeptide, recombinant piwi family polypeptide, and/or piwi family 
analog or peptidomimettc, each prepared as described herein, can 
administered to a subject to impart p/w/ family biological activity In the subject 
and to treat a disorder associated with piwi family biological activity In the 

25 subject. In such case the Imparted p/Vw family biological activity comprises a 
p/w/ family biological activity in accordance with the therapeutic methods of the 
present invention. 

With respect to the therapeutic methods of the present invention, any 
subject can be treated, including animal and plant subjects. A preferred 

30 subject is a vertebrate subject. Prefened examples of vertebrates include fish 
and wamri-blooded vertebrates. A preferred example of a wann-blooded 
vertebrate is a mammal. A preferred example of a mammal is a human. 
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Additionally, as used herein and in the claims, the iem "patient" is 
contemplated to include both human and animal patients, and thus, veterinary 
therapeutic uses are contemplated in accordance with the present invention. 
Accordingly, the methods of the present invention are particularly 
5 contemplated to be useful in the treatment of warm-blooded vertebrates. 
Therefore, the Invention concerns mammals and birds. 

Contemplated is the treatment of mammals such as humans, as well as 
those mammals of importance due to being endangered (such as Siberian 
tigers), of economical importance (animals raised on farms for consumption by 

10 humans) and/or social importance (animals kept as pets or in zoos) to humans, 
for instance, carnivores other than humans (such as cats and dogs), swine 
(pigs, hogs, and wild boars), ruminants (such as cattle, oxen, sheep, giraffes, 
deer, goats, bison, and camels), and horses. Also contemplated is the 
treatment of birds, including the treatment of those kinds of birds that are 

1 5 endangered, kept in zoos, as well as fowl, and more parttculariy domesticated 
fowl, i.e., poultry, such as turkeys, chickens, ducks, geese, guinea fowl, and the 
like, as they are also of economical importance to humans. Thus, 
contemplated is the treatment of livestock, including, but not limited to, 
domesticated swine (pigs and hogs), mminants, horses, poultry, and the like. 

20 Modulation of p/w/ family biological activity 

In one embodiment, a therapeutic method according to the present 
invention comprises administering to a subject a substance that inhibits or 
promotes piwi family biological activity. Such a substance may be identified 
according to the screening assay set forth above. The method comprises 

25 treating a subject suffering from a disorder assodated with or mediated by piwi 
family biological activity by administering to the subject an effective pivw family 
biological activity-modulating amount of a substance identified according to a 
screening assay described above. By the term "modulating", it is contemplated 
that the substance can either promote or inhibit the biological activity of piwi 

30 family polypeptides, depending on the disorderto be treated. For example, the 
administering of antibodies against a chosen PIWI family protein (e.g. HIWI, 
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MIWI, etc.) provides treatment of germline tumor or other developmental or 
tissue defects caused by p/vW family-mediated mechanisms. 

Insofar as a p/vw family biological activity modulator can take the form 
of a polypeptide or of an anti-p/w/ family monoclonal antibody, or fragment 
5 thereof, it is to be appreciated that the potency, and therefore an expression 
of a "therapeutically effective" amount can vary. However, as shown by the 
present assay methods, one skilled in the art can readily assess the potency 
of a candidate p/m family biological activity modulator of this invention. A piwi 
family biological activity modulator can be measured by a variety of means 

1 0 including through the use of a responsive reporter, which drives expression of 
a reporter gene; interaction of piwi family polypeptides with a monoclonal 
antibody as described herein; and the like assays. 

A preferred piwi family biological activity modulator has the ability to 
substantially interact with p/wtf family in solution at modulator concentrations of 

15 less than one (1) micro molar (pM), preferably less than 0.1 pM, and more 
preferably less than 0.01 pM. By "substantially" is meant that at least a 50 
percent modulation in p/vw family biological activity is observed in the presence 
of the piwi family biological activity modulator, and a 50% modulation is referred 
to herein as an IC50 value. 

20 A therapeutically effective amount of a piwi family biological activity 

modulator of this invention in the fomn of a monoclonal antibody, or fragment 
thereof, is typically an amount such that when administered in a physiologically 
tolerable composition is sufRcient to achieve a plasma concentration of from 
about 0.01 microgram (ug) per milliliter (ml) to about 1 00 ug/ml, preferably from 

25 about 1 ug/ml to about 5 ug/ml, and usually about 5 ug/ml. 

A therapeutically effective amount of a piwi family biological activity 
modulator of this invention in the form of a polypeptide is typically an amount 
of polypeptide such that when administered In a physiologically tolerable 
composition is sufficient to achieve a plasma concentration of from about 0.001 

30 microgram (pg) per milliliter (ml) to about 1 0 pg/ml, preferably firom about 0.05 
pg/ml to about 1.0 ug/ml. 
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The monoclonal antibodies or polypeptides of the invention can be 
administered parenterally by injection or by gradual infusion over time. 
Although the tissue to be treated can typically be accessed in the body by 
systemic administration and therefore most often treated by intravenous 
5 administration of therapeutic compositions, other tissues and delivery means 
are contemplated where there is a likelihood that the tissue targeted contains 
the target molecule. Thus, monoclonal antil>odies or polypeptides of the 
invention can be administered intravenously, intraperitoneally, intramuscularly, 
subcutaneously, intra-cavity, transdermally, and can be delivered by peristaltic 
10 means. 

The therapeutic compositions containing a monoclonal antibody or a 
polypeptide of this invention are conventionally administered intravenously, as 
by injection of a unit dose, for example. The iem "unit dose" when used in 
reference to a therapeutic composition of the present invention refers to 
15 physically discrete units suitable as unitary dosage for the subject, each unit 
containing a predetennined quantity of active material calculated to produce 
the desired therapeutic effect in association with the required diluent; I.e., 
canier or vehicle. 

The compositions are administered in a manner compatible with the 
20 dosage formulation, and in a therapeutically effective amount. The quantity to 
be administered depends on the subject to be treated, capacity of the subject's 
system to utilize the active ingredient, and degree of therapeutic effect desired. 
Precise amounts of active ingredient required to be administered depend on 
the judgement of the practitioner and are peoiliarto each individual. However, 
25 suitable dosage ranges for systemic application are disclosed herein and 
depend on the route of administration. Suitable regimes for administration are 
also variable, but are typified by an initial administration followed by repeated 
doses at one or more hour intervals by a subsequent injection or other 
administration. Alternatively, continuous intravenous infusbn sufficient to 
30 maintein concentrations in the blood in the ranges specified for in vivo 
therapies are contemplated. Polypeptide modulators can also be modified as 
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described herein below with respect to biologically active piwi family proteins 
and protein therapy methods using the same. 

Monoclonal Antibodies 
The present invention describes, in one embodiment, piwi family 
5 nrKxlulators in the fomi of monoclonal antibodies which immunoreact with a piwi 
family polypeptide and bind the piwi family polypeptide to modulate biological 
activity as described herein. The invention also describes above cell lines 
which produce the antibodies, methods for producing the cell lines, and 
methods for producing the monoclonal antibodies. 

10 The temi "antibody or antibody molecule" in the various grammatical 

forms is used herein as a collective noun that refers to a population of 
immunoglobulin molecules and/or immunologically active portions of 
immunoglobulin molecules, I.e., molecules that contain an antibody combining 
site or paratope. An "antibody combining site" is that structural portion of an 

15 antibody molecule comprised of heavy and light chain variable and 
hypervariable regions that specifically binds antigen. 

Exemplary antibodies for use in the present invention are intact 
immunoglobulin molecules, substantially intact immunoglobulin molecules, 
single chain immunoglobulins or antibodies, those portions of an 

20 immunoglobulin molecule that contain the paratope, including those portions 
known in the art as Fab, Fab', F(ab')2 and F(v), and also referred to as 
antibody fragments. Indeed, it is contemplated to be within the scope of the 
present invention that a monovalent modulator may optionally be is used in the 
present method. Thus, the terms "nrKxiulate", "modulating", and "modulator^ 

25 are meant to be construed to encompass such promotton. 

The phrase "monoclonal antibody" in its various gi^mmatical forms 
refers to a population of antit)ody molecules that contain only one species of 
antibody combining site capable of immunoreacting with a particular epitope. 
A monoclonal antibody thus typically displays a single binding affinity for any 

30 epitope with which it immunoreacts. A monoclonal antibody may therefore 
contain an antibody molecule having a plurality of antibody combining sites, 
each immunospecific for a different epitope, e.g., a bispecific monoclonal 
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antibody. Methods of producing a monoclonal antit)ody, a hybrldoma cell, or 
a hybridoma cell culture are described above. 

It is also possible to detemiine, without undue experimentation, if a 
monoclonal antibody has the same (i.e., equivalent) specificity (immunoreaction 
5 characteristics) as a monoclonal antibody of this invention by ascertaining 
whether the fbmier prevents the latter from binding to a preselected target 
molecule. If the monoclonal antibody being tested competes with the 
monoclonal antibody of the invention, as shown by a decrease in binding by the 
monoclonal antibody of the invention in standard competition assays for 
10 binding to the target molecule when present in the solid phase, then it is likely 
that the two monoclonal antibodies bind to the same, or a closely related, 
epitope. 

Still another way to determine whether a monoclonal antibody has the 
specificity of a monoclonal antibody of the invention is to pre-incubate the 

15 monoclonal antibody of the invention with the target molecule with which it is 
normally reactive, and then add the monoclonal antibody being tested to 
determine if the monoclonal antibody being tested is inhibited in its ability to 
bind the target molecule. If the monoclonal antibody being tested is inhibited 
then, in all likelihood, it has the same, or functionally equivalent, epitopic 

20 specificity as the monoclonal antibody of the invention. 

An additional way to determine whether a monoclonal antibody has the 
specifidty of a monoclonal antibody of the invention is to detenmine the amino 
acid residue sequence of the CDR regions of the antibodies in question. 
"CDRs" (complementarity detenmining regions) mean the three subregions of 

25 the light or heavy chain variable regions which have hypervariable sequences 
and fomn loop staictures that are primarily responsible for making direct contact 
with antigen. Antibody molecules having identical, or functionally equivalerrt, 
amino acid residue sequences in their CDR regions have the same binding 
specificity. Methods for sequencing polypeptides are well known in the art. 

30 The immunospecrficity of an antit)ody, its target molecule binding 

capacity, and the attendant afRnity the antibody exhibits for the epitope, are 
defined by the epitope with which the antibody immunoreacts. The epitope 
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specificity is defined at least in part by the amino acid residue sequence of the 
variable region of the heavy chain of the immunoglobulin that comprises the 
antibody, and in part by the light chain variable region amino acid residue 
sequence* Use of the temis "having the binding specificity of or "having the 
5 binding preference of indicates that equivalent monoclonal antibodies exhibit 
the same or similar immunoreaction (binding) characteristics and compete for 
binding to a preselected target molecule. 

Humanized monoclonal antibodies offer particular advantages over 
murine monoclonal antibodies, particulariy insofar as they can be used 

10 therapeutically in humans. Specifically, human antibodies are not cleared from 
the circulation as rapidly as "foreign" antigens, and do not activate the immune 
system in the same manner as foreign antigens and foreign antibodies. 
Methods of preparing "humanized" antibodies are generally well known in the 
art, and can readily be applied to the antibodies of the present invention. Thus, 

1 5 the invention contemplates, in one embodiment, a monoclonal antibody of this 
Invention that is humanized by grafting to introduce components of the human 
immune system without substantially interfering with the ability of the antibody 
to bind antigen. 

The use of a molecular cloning approach to generate antibodies, 
20 particulariy monodonal antibodies, and more particulariy single chain 
monoclonal antibodies, is also contemplated. The production of single chain 
antibodies has been described in the art, see e.o.. U.S. Patent No. 5,260,203, 
the contents of which are herein incorporated by reference. For this, 
combinatorial immunoglobulin phagemid libraries are prepared from RNA 
25 isolated from the spleen of the immunized animal, and phagemids expressing 
appropriate antibodies are selected by panning on endothelial tissue. The 
advantages of this approach over conventional hybridoma techniques are that 
approximately 10"* times as many antibodies can be produced and screened 
in a single round, and that new specificities are generated by H and L chain 
30 combination in a single chain, which further increases the chance of finding 
appropriate antibodies. Thus, an antibody of the present invention, or a 
"derivative" of an antibody of the present invention pertains to a single 
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polypeptide chain binding molecule which has binding specificity and affinity 
substantially similar to the binding specificity and affinity of the light and heavy 
chain aggregate variable region of an antibody described herein. 

Other Modulators 

5 Given the disclosure of the p/vWfamily activity in tissues herein, it is also 

contemplated that as yet undefined chemical compounds may be used to 
modulate p/iv/ family activity in tissues in accordance with the methods of the 
present invention. The identification of such compounds is facilitated by the 
description of screening assays directed to piwi family activity in tissues 
10 presented above. 

Gene Therapy 

In accordance with the present invention, p/w/ family genes can be used 
as a tool of gene therapy in humans and mammalian, avian, and fish systems 
to increase the number of stem cells and to increase proliferation of primordial 

1 5 gemi cells in vh/o. 

For example, over-proliferation of malignant stem cells is the leading 
cause of cancer while under-proliferation of stem cells or stem-like progenitor 
cells leads to tissue dystrophy, anemia, immunodeficiency, and male infertility. 
The crucial role of stem cells has long been attributed to their ability to self- 

20 renew and to generate immense number of specialized cells on demand. 
Thus, p/iv/ family genes can be used for gene therapy in accordance with the 
present invention. Exemplary gene therapy methods, induding liposomal 
transfection of nucleic adds into host cells, are described in U.S. Patent Nos. 
5.279,833; 5,286,634; 5,399,346; 5,646,008; 5,651.964; 5,641,484; and 

25 5,643,567, the contents of each of which are herein incorporated by reference. 

Briefly, p/w/ family gene therapy directed toward modulation of growth, 
proliferation and self-renewing division in a target ceil is described. Target cells 
include but are not limited to gemnline stem cells, primordial germ cells and 
cancerous or tumorous ceils. In one embodiment, a therapeutic method of the 

30 present invention contemplates a method for modulating of growth, prolrferation 
or self-renewing division in a cell comprising the steps of: (a) delivering to the 
cell an effective amount of a DNA molecule comprising a polynucleotide that 



wo 00/32039 



-58- 



PCT/US99/28764 



encodes a p/w; family polypeptide that modulates growth, proliferation or self- 
renewing division; and (b) maintaining the cell under conditions sufTicient Ibr 
expression of said polypeptide. 

In a preferred embodiment, the p/iv/ family polypeptide is HIWI or MIWI. 
5 Delivery is preferably accomplished by injecting the DNA molecule into the cell. 
Where the cell is in a subject delivering is preferably administering the DNA 
molecule into the circulatory system of the subject. In a preferred embodiment, 
administering comprises the steps of: (a) providing a vehicle that contains the 
DNA molecule; and (b) administering the vehicle to the subject. 

10 A vehicle is preferably a cell transfomied or transfected with the DNA 

molecule or a transfected cell derived from such a transfomied or transfected 
cell. An exemplary and preferred transfomied or transfected cell is a 
spermatogonial cell or a tumor cell from the tumor being treated. Techniques 
for transforming or transfecting a cell with a DNA molecule of the present 

1 5 invention are set forth above. 

Alternatively, the vehicle Is a vims or an antibody that specifically infects 
or immunoreacts with an antigen of the tumor. Retroviruses used to deliver the 
oonstmcts to the host target tissues generally are viruses in which the 3-LTR 
(linear transfer region) has been inactivated. That is, these are enhanceriess 

20 3-LTR's, often referred to as SIN (self-inactivating viruses) because after 
productive infection into the host cell, the 3-LTR is transferred to the 5'-end 
and both viral LTR's are inactive with respect to transcriptional activity. A use 
of these viruses well known to those skilled in the art is to clone genes for 
which the regulatory elements of the cloned gene are inserted in the space 

25 between the two LTR*s. An advantage of a viral infection system Is that it allows 
for a very high level of Infection into the appropriate recipient cell. 

Antibodies have been used to target and deliver DNA molecules. An 
N-terminal modified poly-L-lyslne (NPLL)-antlbody conjugate readily forms a 
complex with plasmid DNA. A complex of monoclonal antibodies against a cell 

30 surface thrombomodulin conjugated with NPLL was used to target a foreign 
plasmid DNA to an antigen-expressing mouse lung endothelial cell line and 
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mouse lung. Those targeted endothelial cells expressed the product encoded 
by tiiat foreign DNA, 

It is also envisioned that this embodiment of the present inventton can 
be practiced using alternative viral or phage vectors, including retroviral 
5 vectors, adenoviral vectors and vaccinia viruses whose genome has been 
manipulated in alternative ways so as to render the virus non-patiiogenic. 
Methods for creating such a viral mutation are set forth in detail in U.S. Pat No. 
4,769,331, Incorporated herein by reference. 

By way of specific example, hwi, miwi, and their homologs from warm- 
10 blooded vertebrates are introduced into isolated spermatogonia! cells or other 
relevant cells. The re-injection of the transgene-carrying cells into the testis or 
other relevant tissues provides a treatment for male infertility or other relevant 
diseases in human and animals. 

Protein Therapy 

15 In another embodiment, the direct introduction of the p/iv/family proteins 

into a diseased tissue is contemplated to provide a therapeutic effect in human 
and animals. Such a therapeutic method comprises administering to a subject 
a therapeutic composition which comprises a biologically active piwi family 
polypeptide of the present invention in amount effective to modulate a piwh 

20 mediated biological activity in tiie subject. 

In one embodiment, a polypeptide for use in such a composition 
comprises no more than about 100 amino acid residues, preferably no more 
than about 60 residues, more preferably no more than about 30 residues. 
Peptides can be linear or cyclic. 

25 A subject polypeptide includes any analog, fragment or chemical 

derivative of a piwi family polypeptide. Such a polypeptide can be subject to 
various changes, substitutions, insertions, and deletions where such changes 
provide for certain advantages In its use. In this regard, an piwi family 
polypeptide for use in a therapeutic method of the present invention 

30 comesponds to, rather than is identical to, the sequence of a native piwi family 
polypeptide where one or more changes are made and it retains the ability to 
function as a piwi lamWy polypeptide in one or more of the assays as defined 
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herein. Thus, a polypeptide can be In any of a variety of forms of peptide 
derivatives, that include amides, conjugates with proteins, cyclized peptides, 
polymerized peptides, analogs, fragments, chemically modified peptides, and 
the like derivatives. 

5 The term "analog" includes any polypeptide having an amino acid 

residue sequence substantially identical to a sequence of an endogenous piwi 
family polypeptide in which one or more residues have been conservatively 
substituted with a functionally similar residue and which displays the p/iv/ family 
biological activity as described herein. Examples of conservative substitutions 

10 include the substitution of one non-polar (hydrophobic) residue such as 
isoleucine, valine, leucine or methionine for another; the substitution of one 
polar (hydrophilic) residue for another such as between arginine and lysine, 
between giutamine and asparagine, between glycine and serine; the 
substitution of one basic residue such as lysine, arginine or histidine for 

1 5 another; or the substitution of one acidic residue, such as aspartic acid or 
glutamic acid for another. Such substitutions are described in detail above with 
respect to the isolated and purified piwi family polypeptide of the present 
invention. 

The phrase "conservative substitution" also includes the use of a 
20 chemically derivatized residue in place of a non-derivatlzed residue provided 
that such polypeptide displays the requisite biological activity. 

"Chemical derivative" refers to a subject polypeptide having one or more 
residues chemically derivatized by reaction of a functional side group. Such 
derivatized molecules include for example, those molecules in which fifee 
25 amino groups have been derivatized to form amine hydrochlorides, p-toluene 
sulfonyl groups, cart^obenzoxy groups, t-butyloxycart)onyl groups, chloroacetyl 
groups or fonmyl groups. Free cart)oxyl groups may be derivatized to form 
salts, methyl and ethyl esters or other types of esters or hydrazides. Free 
hydroxyl groups may be derivatized to fomn O-acyl or 0-alkyl derivatives. The 
30 imidazole nitrogen of histidine may be derivatized to fomn N-im-benzylhistidine. 
Also included as chemical derivatives are those peptides which contain one or 
more naturally occumng amino acid derivatives of the twenty standard amino 
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acids. For examples: 4-hydroxyprolme may be substituted for proline; 5- 
hydroxylyslne may be substituted for lysine; 3-methylhistldlne may be 
substituted for histidine; homoserine may be substituted for serine; and 
ornithine may be substituted for lysine. Polypeptides of the present invention 
5 also include any polypeptide having one or more additions and/or deletions or 
residues relative to the sequence of a polypeptide whose sequence is shown 
herein, so long as the requisite activity is maintained. 

The temi "fragment" refers to any subject polypeptide having an amino 
acid residue sequence shorter than that of a polypeptide whose amino acid 

10 residue sequence is shown herein. 

When a polypeptide of the present invention has a sequence that is not 
identical to the sequence of a p/iv/family endogenous polypeptide, it is typically 
because one or more conservative or non-conservative substitutions have 
been made, usually no more than about 30 number percent, and preferably no 

15 more than 10 number percent of the amino acid residues are substituted. 
Additional residues may also be added at either terminus of a polypeptide for 
the purpose of providing a linker" by which the polypeptides of this invention 
can be conveniently affixed to a label or solid matrix, or carrier. Labels, solid 
matrices and carriers that can be used with the polypeptides of this invention 

20 are described elsewhere herein. 

Amino acid residue linkers are usually at least one residue and can be 
40 or more residues, more often 1 to 10 residues, but do not fbmi p/w/ family 
polypeptide epitopes. Typical amino acid residues used for linking are tyrosine, 
cysteine, lysine, glutamic and aspartic acid, or the like. In addition, a subject 

25 polypeptide can differ, unless othenvise specified, from the natural sequence 
of a piiv/ family polypeptide by the sequence being modified by ten7iinal-NH2 
acylation, e.g., acetylation, or thioglycolic acid amidation, by tenninal- 
carboxylamidation, e.g.. with ammonia, methylamine, and the like tenmlnal 
modifications. Terminal modifications are useful, as is well known, to reduce 

30 susceptibility by proteinase digestion, and therefore serve to prolong half life 
of the polypeptides In solutions, particulariy biological fluids where proteases 
may be present. In this regard, polypeptide cyclization is also a useful terminal 
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modification, and is particularly preferred also because of the stable structures 
fornned by cyclization and in view of the biological activities observed for such 
cyclic peptides as described herein. 

Any peptide of the present invention may be used in the fomi of a 
5 phamiaceutically acceptable salt. Suitable acids which are capable of the 
peptides with the peptides of the present invention include inorganic acids such 
as trifluoroacetic acid (TFA), hydrochloric acid (HCl), hydrobromic acid, 
perchloric acid, nitric acid, thiocyanic acid, sulfuric acid, phosphoric acetic acid, 
propionic acid, glycolic acid, lactic acid, pyruvic acid, oxalic acid, malonic acid, 

10 succinic acid, maleic acid, fumaric acid, anthranilic acid, cinnamic acid, 
naphthalene sulfonic acid, sulfanilic acid or the like. HCl and TFA salts are 
particularly prefen^ed. 

Suitable bases capable of forming salts with the peptides of the present 
invention include inorganic bases such as sodium hydroxide, ammonium 

1 5 hydroxide, potassium hydroxide and the like; and organic bases such as mono- 
di- and tri-alkyi and aryl amines (e.g. triethylamine, diisopropyl amine, methyl 
amine, dimethyl amine and the like), and optionally substituted ethanolamines 
(e.g. ethanolamine, diethanolamine and the like). 

A peptide of the present invention, also referred to herein as a subject 

20 polypeptide, can be synthesized by any of the techniques that are known to 
those skilled in the polypeptide art. including recombinant DNA techniques. 
Synthetic chemistry techniques, such as a solid-phase Memfield-type 
synthesis, are prefenred for reasons of purity, antigenic specificity, freedom 
from undesired side products, ease of production and the like. An excellent 

25 summary of the many techniques available can be found in Steward et al., 
"Solid Phase Peptide Synthesis^ W. H. Freeman Co., San Francisco, 1969; 
Bodanszky, et al., "Peptide Synthesis". John Wiley & Sons, Second Edition, 
1976; J. Meienhofer, "Honnonal Proteins and Peptides", Vol. 2, p. 46, 
Academic Press (New Yoric), 1983; Menifield, /IdvEnzymo/, 32:221-96, 1969; 

30 Fields et al., Int J. Peptide Protein Res., 35:161-214. 1990; and U.S. Pat, No. 
4,244,946 for solid phase peptide synthesis, and Schroder et al., 'The 
Peptides", Vol. 1, Academic Press (New Yori<). 1965 for classical solution 
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synthesis, each of which is incorporated herein by reference. Appropriate 
protective groups usable in such synthesis are described in the above texts 
and in J. F. W. McOmle. "Protective Groups in Organic Chemistry", Plenum 
Press, New Yoric, 1973, which is incorporated herein by reference. 
5 In general, the solid-phase synthesis methods contemplated comprise 

the sequential addition of one or more amino acid residues or suitably 
protected amino acid residues to a growing peptide chain. Normally, either the 
amino or carboxyl group of the first amino acid residue is protected by a 
suitable, selectively removable protecting group, a different, selectively 

1 0 removable protecting group is utilized for amino acids containing a reactive side 
group such as lysine. 

Using a solid phase synthesis as exemplary, the protected or derivatized 
amino acid is attached to an inert solid support through its unprotected carboxyl 
or amino group. The protecting group of the amino or carboxyl group is then 

15 selectively removed and the next amino acid in the sequence having the 
complimentary (amino or carboxyl) group suitably protected is admixed and 
reacted under conditions suitable for forming the amide linkage with the residue 
already attached to the solid support. The protecting group of the amino or 
carboxyl group is then removed from this newly added amino acid residue, and 

20 the next amino acid (suitably protected) is then added, and so forth. After all 
the desired amino acids have been linked in the proper sequence, any 
remaining temninal and side group protecting groups (and solid support) are 
removed sequentially or concun^ntly, to afford the final linear polypeptide. 
The resultant linear polypeptides prepared for example as described 

25 above may be reacted to fonn their corresponding cyclic peptides. An 
exemplary method for cyclizing peptides is described by Zimmer et al.. 
Peptides 1992. pp. 393^94, ESCOM Science Publishers. B. V„ 1993. 
Typically, tertbutoxycarbonyl protected peptide methyl ester is dissolved in 
methanol and sodium hydroxide solution are added and the admixture is 

30 reacted at 20°C to hydrolytically remove the methyl ester protecting group. 
After evaporating the solvent, the tertbutoxycart^onyl protected peptide is 
extracted with ethyl acetate from acidified aqueous solvent. The 
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tertbutoxycarbonyi protecting group is ihen removed under mildly acidic 
conditions in dioxane cosotvent. The unprotected linear peptide witli free amino 
and carboxy temiini so obtained is converted to its conresponding cyclic peptide 
by reacting a dilute solution of the linear peptide, in a mixture of 
5 dtchloromethane and dimethylformamide, with dicyclohexylcarbodiimide in the 
presence of 1-hydroxybenzotriazole and N-methylmorpholine. The resultant 
cyclic peptide is then purified by chromatography. 

Antisense Oliqonulceotide Therapy 
In another embodiment, the administering of antisense RNA against a 

10 chosen p/w/ family gene (e.g, hiwi, miwi, etc.) provides treatment of germline 
tumor or other developmental or tissue defects caused by p/iv/ family-mediated 
mechanisms. In accordance with the present invention such a therapeutic 
method may alternatively comprise promoting or Inhibiting piwi family in a 
subject by administering an efTective amount of a substance that inhibits or 

15 promotes expression of a p/iv; family-encoding nucleic acid segment in the 
subject. Examples of such a substance, include, for example, an antisense 
oligonucleotide derived from SEQ ID N0s:1, 3 or 5. Therapeutic methods 
utilizing antisense oligonucleotides have been described in the art, for example 
in U.S. Patent Nos. 5,627,158 and 5*734,033, the contents of each of which 

20 are herein Incorporated by reference. 

Fomiulation of Therapeutic Compositions 
The p/m family biological activity modulating substances, gene therapy 
vectors, biologically activity piwi famWy gene products, and substances that 
inhibit or promote expression of a p/wi family-encoding nucleic acid segment 

25 described above are adapted for administiration as a phannaceutical 
compositions as described above. Additional formulation and dose preparation 
techniques have been described in the art, see for example, those described 
in U.S. Patent No. 5,326,902 Issued to Seipp et al. on July 5, 1994, U.S. Patent 
No. 5,234,933 issued to Mamett et aL on August 10, 1993, and PCT 

30 Publication WO 93/25521 of Johnson et aL published December 23, 1 993, the 
entire contents of each of which are herein incorporated by reference. 
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For the purposes described above, the identified substances may 
normally be administered systemically or partially, usually by oral or parenteral 
administration. The doses to be administered are detemiined depending upon 
age, body weight, symptom, the desired therapeutic effect, the route of 
5 administration, and the duration of the treatment etc. In a human adult, the 
doses per person per administration are generally between 1 mg and 500 mg, 
by oral administration, up to several times per day, and between 1 mg and 100 
mg, by parenteral administration up to several times per day. Since the doses 
to be used depend upon various conditions, as mentioned above, there may 
10 be a case In which doses are lower than or greater than the ranges specified 
above. 

Solid compositions for oral administration include compressed tablets, 
pills, dispersible powders, capsules, and granules. In such compositions, one 
or more of the active substance(s) is or are, admixed with at least one inert 

15 diluent (lactose, mannitol, glucose, hydroxypropylcellulose, microcrystalline 
cellulose, starch, polyvinylpynDtidone, magnesium metasilicate alminate, etc.). 
The compositions may also comprise, as is nomrial practice, additional 
substances other than inert diluents: e.g. lubricating agents (magnesium 
stearate, etc.), disintegrating agents (cellulose, calcium glycolate etc.), and 

20 assisting agent for dissolving (glutamic acid, aspartic acid, etc.) stabilizing 
agent (lactose etc.). The tablets or pills may, if desired, be coated with gastric 
or enteric material (sugar, gelatin, hydroxypropylcellulose or 
hydroxypropylmethyi cellulose phthalate, etc.)- Capsules include soft ones and 
hard ones. 

25 Liquid compositions for oral administration include phamnaceutically- 

acceptable emulsions, solutions, suspensions, syrups and eli}drs. In such 
compositions, one or more of the active $ubstance(s) is or are admixed with 
inert diluent(s) commonly used in the art (purified water, ethanoi etc.). Besides 
inert diluents, such compositions may also comprise adjuvants (wetting agents, 

30 suspending agents, etc.), sweetening agents, flavoring agents, perfuming 
agents and preserving agents. 



wo 00/32039 



PCT/US99y28764 



-66- 

Other compositions for oral administration include spray compositions 
which may be prepared by known methods and which comprise one or more 
of the active substance(s). Spray compositions may comprise additional 
substances other than inert diluents: e.g. preserving agents (sodium sulfite. 
5 etc.), isotonic buffer (sodium chloride, sodium citrate, citric acid, etc.). For 
preparation of such spray compositions, for example, the method described in 
U.S. Patent Nos. 2,868,691 or 3.095,355 may be used. 

Injections for parenteral administration include sterile aqueous or non- 
aqueous solution, suspensions and emulsions. In such compositions, one or 

10 more of active substance(s) is or are admixed with at least one inert aqueous 
diluent(s) (distilled water for injection, physiological salt solution etc.) or inert 
non-aqueous diluent(s) (propylene glycol, polyethylene glycol, olive oil, ethanol, 
POLYSOLBATE 80® etc.). Injections may comprise additional otherthan inert 
diluents: e.g, preserving agents, wetting agents, emulsifying agents, dispersing 

1 5 agents, stabilizing agents (lactose, etc.), assisting agents such as for dissolving 
(glutamic acid, aspartic acid, etc.). They may be sterilized, for example, by 
filtration through a bacteria-retaining filter, by incorporation of sterilizing agents 
in the compositions or by in^adiation. They also be manufactured in the form 
of sterile solid compositions, for example, by freeze-drying, and which can be 

20 dissolved in sterile water or some otiier sterile diluents for injection immediately 
before use. 

Other compositions for administration include liquids for extemal use, 
and endenmic linaments (ointment, etc.), suppositories and pessaries which 
comprise one or more of the active substance(s) and may be prepared by 
25 known methods. 

Detecting a Polvnudeotide or a Polvpeptide of the Present Invention 
Altematively, the present invention provides a method of detecting a 
polypeptide of the present invention, wherein the method comprises 
immunoreacting the polypeptides with antibodies prepared according to the 
30 method described above to form antibody-polypeptide conjugates, and 
detecting the conjugates. 
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In yet another embodiment, the present invention contemplates a 
method of detecting messenger RNA transcripts that encode a polypeptide of 
the present invention, wherein the method comprises hybridizing the 
messenger RNA transcripts with polynucleotide sequences that encode the 
polypeptide to form duplexes; and detecting the duplex. Alternatively, the 
present invention provides a method of detecting DNA molecules that encode 
a polypeptide of the present Invention, wherein the method comprises 
hybridizing DNA molecules withapolynucleotide that encodes that polypeptide 
to form duplexes; and detecting the duplexes. 

The detection and screening assays disclosed herein can be used as 
a prognosis tool. hiwi. mim and their mammalian homologous genes as well 
as their protein products can be readily used in clinical setting as a prognostic 
Indicator for screening infertility, gennline cancer, or other heritable p/iv/-related 
diseases in human and animals. 

The detection and screening assays disclosed herein can be also used 
as a part of a diagnostic method, hiwi, miwi, and their mamallan homologs as 
well as their proteins can be readily used in clinical practice to diagnose the 
cause of infertility, testicular cancer, or other heritable p/wArelated diseases in 
human and animals. 

Screening Assavs for a Polvpeotide of the 
Present Invention 
The present invention provides a method of screening a biological 
sample forthe presence of apAv/femily polypeptide. Preferably, thep/Vwfemlly 
polypeptide possesses a biological activity In accordance with the present 
invention. A biological sample to be screened can be a biological fluid such as 
extracellular or intracellular fluid or a cell or tissue extract or homogenate. A 
biological sample can also be an isolated cell (e.g., in culture) or a collection 
of cells such as in a tissue sample or histology sample. A tissue sample can 
be suspended in a liquid medium or fixed onto a solid support such as a 
microscope slide. 
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In accordance with a screening assay method, a biological sample is 
exposed to an antibody immunoreactive with the polypeptide whose presence 
is being assayed. Typically, exposure is accomplished by fonming an 
admixture in a liquid medium that contains both the antibody and the candidate 
5 polypeptide. Either the antibody or the sample with the polypeptide can be 
affixed to a solid support (e.g., a column or a microtiter plate). 

The biological sample is exposed to the antibody under biological 
reaction conditions and for a period of time sufficient for antibody-polypeptide 
conjugate formation. Biological reaction conditions include ionic composition 

10 and concentration, temperature, pH and the like. 

Ionic composition and concentration can range from that of distilled 
water to a 2 molal solution of NaCI. Preferably, osmolality is from about 100 
mosmols/l to about 400 mosmols/l and, more preferably from about 200 
mosmols/l to about 300 mosmols/l. Temperature preferably is from about4°C. 

15 to about lOO^'C, more preferably from about 15°C. to about 50^*0. and, even 
more preferably from about 25^C to about 40''C. pH is preferably from about 
a value of 4.0 to a value of about 9.0, more preferably from about a value of 6.5 
to a value of about 8.5 and, even more preferably from about a value of 7.0 to 
a value of about 7.5. The only limit on biological reaction conditions is that the 

20 conditions selected allowforantibody-polypeptide conjugate fomiation and that 
the conditions do not adversely affect either the antibody or the polypeptide. 

Exposure time will vary inter alia with the biological conditions used, the 
concentration of antibody and polypeptide and the nature of the sample (e.g., 
fluid or tissue sample). Means for determining exposure time are well known 

25 to one of ordinary skill in tiie art. Typically, where the sample is fluid and the 
rancentration of polypeptide in that sample is about lO'^^M, exposure time is 
from about 10 minutes to about 200 minutes. 

The presence of polypeptide in the sample is detected by detecting the 
formation and presence of antibody-polypeptide conjugates. Means for 

30 detecting such antibody-antigen (e.g., receptor polypeptide) conjugates or 
complexes are well known in the art and include such procedures as 
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centrifugation, affinity chromatography and the like, binding of a secondary 
antibody to the antibody-candidate receptor complex. 

In one embodiment, detection is accomplished by detecting an indicator 
affixed to the antibody. Exemplary and well known such indicators include 
5 radioactive labels (e.g., ^P, ^^1 , ^^C), a second antibody or an enzyme such as 
horse radish peroxidase. Means for affixing indicators to antibodies are well 
known in the art. Commercial kits are available. 

Screening Assav for Anti-Polypeptide Antibodv 
In another aspect, the present invention provides a method of screening 
10 a biological sample for the presence of antibodies immunoreactive with a piwi 
family polypeptide. Preferably the piwi family polypeptide has a biological 
activity in accordance with the present invention. In accordance with such a 
method, a biological sample is exposed to a piwi family polypeptide under 
biological conditions and for a period of time sufficient for antibody-polypeptide 
1 5 conjugate formation and the fomned conjugates are detected. 

Screening Assav for Polynucleotide That Encodes a 
p/vw family Polvpeotide of the Present Invention 
A DNA molecule and, particularly a probe molecule, can be used for 
hybridizing as an oligonucleotide probe to a DNA source suspected of 
20 encoding a p/w/ family polypeptide of the present invention. Preferably the piwi 
family polypeptide possesses a biological activity in accordance with fhe 
present invention. The probing is usually accomplished by hybridizing the 
oligonucleotide to a DNA source suspected of possessing a p/W family gene. 
In some cases, the probes constitute only a single probe, and in others, the 
25 probes constitute a collection of probes based on a certain amino acid 
sequence or sequences of the polypeptide and account in their diversity forthe 
redundancy inherent in the genetic code. 

A suitable source of DNA for probing in this manner is capable of 
expressing a polypeptide of the present invention and can be a genomic library 
30 of a cell line of interest. Alternatively, a source of DNA can include total DNA 
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from the cell line of interest Once the hybridization method of the invention 
has identified a candidate DNA segment, one confinms that a positive clone has 
been obtained by further hybridization, restriction enzyme mapping, 
sequencing and/or expression and testing. 
5 Aitematively, such DNA molecules can be used in a number of 

techniques including their use as: (1) diagnostic tools to detect nomnal and 
abnomial DNA sequences in DNA derived from patienfs cells; (2) reagents for 
detecting and Isolating other members of the polypeptide family and related 
polypeptides from a DNA library potentially containing such sequences; (3) 

10 primers for hybridizing to related sequences for the purpose of amplifying those 
sequences; (4) primers for altering native p/Vw family DNA sequences; as well 
as other techniques which rely on the similarity of the DNA sequences to those 
of the DNA segments herein disclosed. 

As set forth above, in certain aspects, DNA sequence information 

1 5 provided by the invention allows for the preparation of relatively short DNA (or 
RNA) sequences (e.g., probes) that specifically hybridize to encoding 
sequences of a selected piwi family gene. In these aspects, nucleic acid 
probes of an appropriate length are prepared based on a consideration of the 
encoding sequence for a polypeptide of this invention. The ability of such 

20 nucleic acid probes to specifically hybridize to other encoding sequences lend 
them particular utility in a variety of embodiments. Most importantly, the probes 
can be used in a variety of assays for detecting the presence of complementary 
sequences in a given sample. However, other uses are envisioned, including 
the use of the sequence information for the preparation of mutant species 

25 primers, or primers for use in preparing other genetic constructions. 

To provide certain of the advantages in accordance with the invention, 
a prefenred nucleic acid sequence employed for hybridization studies or assays 
includes probe sequences that are complementary to at least a 14 to 40 or so 
long nucleotide stretch of a nucleic acid sequence of the present invention, 

30 such as a sequence shown in any of SEQ ID N0s:1 , 3 and 5. A size of at least 
14 nucleotides in length helps to ensure that the fragment is of sufficient length 
to form a duplex molecule that Is both stable and selective. Molecules having 
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complementary sequences overstretches greater than 14 bases in length are 
generally preferred, though, to Increase stability and selectivity of the hybrid, 
and thereby improve the quality and degree of specific hybrid molecules 
obtained. One will generally prefer to design nucleic acid molecules having 
5 gene-complementary stretches of 1 4 to 20 nucleotides, or even longer where 
desired. Such Iragments can be readily prepared by, for example, directly 
synthesizing the fragment by chemical means, by application of nucleic acid 
reproduction technology, such as the PGR technology of U.S. Pat. No. 
4.683,202, herein incorporated by reference, or by introducing selected 

1 0 sequences into recombinant vectors for recombinant production. 

Accordingly, a nucleotide sequence of the present invention can be used 
for its ability to selectively form duplex molecules with complementary stretches 
of the gene. Depending on the application envisioned, one employs varying 
conditions of hybridization to achieve varying degrees of selectivity of the probe 

15 toward the target sequence. For applications requiring a high degree of 
selectivity, one typically employs relatively stringent conditions to form the 
hybrids. For example, one selects relatively low salt and/or high temperature 
conditions, such as provided by 0.02M-0.15M NaCI at temperatures of SO'^C 
to ZO'^C. Such conditions are particulariy selective, and tolerate little, if any, 

20 mismatch between the probe and the template or target stnand. 

Of course, for some applications, for example, where one desires to 
prepare mutants employing a mutant primer strand hybridized to an underiying 
template or where one seeks to isolate polypeptide coding sequences from 
related species, functional equivalents, or the like, less stringent hybridization 

25 conditions are typically needed to allow formation of the heteroduplex. Under 
sudi circumstances, one employs conditions such as 0.15M-0.9M salt, at 
temperatures ranging fifom 20''C to SS^'C. Cross4iybridizing species can 
thereby be readily identified as positively hybridizing signals with respect to 
control hybridizations. In any case, it is generally appreciated that conditions 

30 can be rendered more stringent by the addition of increasing amounts of 
formamide, which serves to destabilize the hybrid duplex in the same manner 
as increased temperature. Thus, hybridization conditions can be readily 
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manipulated, and thus will generally be a method of choice depending on the 
desired results. 

In certain embodiments, it is advantageous to employ a nucleic acid 
sequence of the present invention in combination with an appropriate means, 
5 such as a label, for detemriining hybridization. A wide variety of appropriate 
indicator means are known in the art, including radioactive, enzymatic or other 
llgands, such as avidin/blotin, which are capable of giving a detectable signal 
In preferred embodiments, one likely employs an enzyme tag such a urease, 
alkaline phosphatase or peroxidase, instead of radioactive or other 

1 0 environmentally undesirable reagents. In the case of enzyme tags, calorimetric 
indicator substrates are known which can be employed to provide a means 
visible to the human eye or spectrophotometrically, to identify specific 
hybridization with complementary nucleic acid-containing samples. 

In general, it is envisioned thatthe hybridization probes described herein 

1 5 are useful both as reagents in solution hybridization as well as in embodiments 
employing a solid phase. In embodiments involving a solid phase, the sample 
containing test DNA (or RNA) is adsorbed or otherwise affixed to a selected 
matrix or surface. This fixed, single-stranded nucleic acid is then subjected to 
specific hybridization with selected probes under desired conditions. The 

20 selected conditions depend inter alia on the particular circumstances based on 
the particular criteria required (depending, for example, on the G+ C contents, 
type of target nucleic acW, source of nucleic acid, size of hybridization probe, 
etc.). Following washing of the hybridized surface so as to remove 
nonspeciflcally bound probe molecules, specific hybridization is detected, or 

25 even quantified, by means of the label. 

Assay Kits 

In another aspect, the present invention contemplates assay kits for 
detecting the presence of a polypeptide of the present invention in biological 
samples, where the kits comprise a first container containing a first antibody 
30 capable of immunoreacBng with the polypeptide, with the first antibody present 
in an amount sufficient to perfomn at least one assay. Preferably, the assay kits 
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of the invention further comprise a second container containing a second 
antibody that immunoreacts with the first antilx>dy. More preferably, the 
antibodies used in the assay k\te of the present invention are monoclonal 
antibodies. Even more preferably, the first antibody is affixed to a solid 
5 support. More preferably still, the first and second antibodies comprise an 
indicator, and, preferably, the indicator is a radioactive label or an enzyme. 

The present invention also contemplates a kit for screening agents. 
Such a kit can contain a polypeptide of the present invention. The kit can 
contain reagents for detecting an interaction between an agent and a receptor 

10 of the present invention. The provided reagent can be radiolabeled. The kit 
can contain a known radiolabelled agent capable of binding or Interacting with 
a receptor of the present invention. 

In an alternative aspect, the present invention provides assay kits for 
detecting the presence, in biological samples, of a polynucleotide that encodes 

15 a polypeptide of the present invention, the kits comprising a first container that 
contains a second polynucleotide identical or complementary to a segment of 
at least 10 contiguous nucleotide bases of, as a prefened example, SEQ ID 
NOs:1,3and5. 

In another embodiment, the present invention contemplates assay kits 
20 for detecting the presence, in a biological sample, of antibodies 
immunoreactive with a polypeptide of the present invention, the kits comprising 
a first container containing a piwi family polypeptide, that immunoreacts with 
the antibodies, with the polypeptide present in an amount sufRcientto perform 
at least one assay. Preferably, the p/wi femily polypeptide has a biological 
25 activity in accordance with the present invention. The reagents of the kit can 
be provided as a liquid solutbn, attached to a solid support or as a dried 
powder. Preferably, when the reagent is provided in a liquid solution, the liquid 
solution is an aqueous solution. Preferably, when the reagent provided is 
attached to a solid support, the solid support can be chromatograph media or 
30 a microscope slide. When the reagent provided is a dry powder, the powder 
can be reconstituted by the addition of a suitable solvent. The solvent can be 
provided. 
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EXAMPLES 

Thefollowing examples have been included to illustrate prefened modes 
of the invention. Certain aspects of the following examples are described in 
terms of techniques and procedures found or contemplated by the present 
5 inventors to work well in the practice of the invention. These examples are 
exemplified through the use of standard laboratory practices of the inventors. 
In light of the present disclosure and the general level of skill In the art, those 
of skill will appreciate that the following examples are Intended to be exemplary 
only and that numerous changes, modifications and alterations can be 

10 employed without departing from the spirit and scope of the invention. 

In exploring stem cell renewal mechanisms in the germline, the piwi 
gene is disclosed herein to be essential for GSC maintenance in Drosophila, 
The role of the piw'i gene is characterized herein by recessive mutations that 
cause failure In GSC maintenance in both females and males, piwi mutant 

15 gonads contain a nomial number of GSCs at the onset of gametogenesis. 
However, mutant adult gonads are devoid of GSCs and contain only a very 
small number of gametes that is approximately equal to or less than the wild- 
type number of GSCs. These obsen^ations reveal that piwi is essential for 
GSC maintenance in both males and females. 

20 Summarv of Observattons in Examoles 

The purification and isolation of cDNAs representing p/iv/ family genes 
from human, mouse, Drosophilia are disclosed in the Examples, along with the 
primary structure of the PIWI proteins in human, mouse and Drosophila as 
deduced from the cDNA sequences. Additionally, the essential function of p/iv; 

25 in Drosophilia genmline stem cell growth, proliferation and selfHPenewal and in 
embryonic development is also disclosed. 

Further, the identification of piwi homologous genes, prg-l and prg-Z in 
C. elegans and the demonstnEition of the important function of prg-l and prg-ll 
in C. elegans gemnline self-renewal similar to that of piwi in Drosophilia is 

30 described in the Examples. The identification of two pAv/ family-like genes in 
Arabidopsis, Zwille and argonaute(ago), that are known to be required for 
meristem cell divisions Is also described. 
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The growth, proliferation and self-renewing division modulating biological 
activity of human and murine piwi genes (named hiwi and miwi, respectively) 
in germline stem cell renewal, proliferation of primordial germ cells and in other 
developmental processes is characterized in the Examples based on the 
5 following data and reasoning:(i) hiwi and miwi are more homologous to piwi 
than prg-l, prg-U, Zwille and ago to piwi; (ii) human and murine gemnline stem 
cells are much more similar to Drosophilia gemnline cells than to the C. elegans 
gemnline or Arabidopsis meristem cells; (iii) the function of piwi family genes 
are well conserved among Drosophilia, C- elegans, and Arabidopsis, which are 

10 evolutionarily more diverse than between Drosophilia and human/mammals; 
and (iv) hiwi and miwi is indeed specifically expressed in the adult human and 
murine testis, respectively, but not in the adult human of murine ovary which 
does not contain gemiline stem cells, miwi is further shown to be expressed 
specifically in the gemnltne stem ceil region in the mouse testis. Therefore, 

15 hiwi, miwi, and other mammalian piwi family genes are shown to play an 
important roie in germline stem cell division and other stem cell-related 
processes. 

The specific expression pattern of hiwi and miwi in the adult human and 
murine testis, respectively, but not in the adult human or murine ovary which 

20 do not contain genmline stem cells is also demonstrated in the Examples, miwi 
is further shown to be expressed in the gemniine stem cell region in the mouse 
testis. The discovery of a highly conserved pnDtein signature motif of 43-amino 
acid residues, the PIWI box, that exists in the p/Vkf family genes as well as in a 
broader dass of novel genes which may be structurally and functionally defined 

25 by the PIWI box is also described in the Examples. The demonstrntion that the 
PIWI protein and, thus, the PRG-1, PRG-2, MIWI. HIWI. Zwille, and AGO 
proteins are nuclear proteins is also disclosed. 

Example 1 
Isolation and Purification of oiwi 

30 The cloning and characterization of the Drosophila piwi gene is 

disclosed in this Example. Germline stem ceils (GSCs) serve as the source for 
gametogenesis in diverse organisms. The piwi gene product is required forthe 
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asymmetric division of GSC to produce and maintain a daughter GSC but is not 
essential for the further differentiation of the committed daughter cell. Genetic 
mosaic and RNA in situ analyses indicate that piwi expression in adjacent 
somatic cells regulates GSC division. 
5 piwi encodes a highly basic novel protein well conserved during 

evolution. This Example also describes the isolation of p/w/ family homologs 
in C. elegans and human. This Example further describes the identification of 
Ambidopsis piwi family-like genes known to be required for meristem cell 
maintenance. Decreasing C. elegans p/vw expression reduces the proliferation 

10 of GSC-equivalent cells. Thus, piwi represents a novel class of genes required 
for GSC division In diverse organisms. 

Drosoptiila has been used in the art as a model for studying 
mechanisms that are conserved among diverse developmental systems. As 
show herein, this is also the case for the study of stem cells. In Drosophila, 

15 stem cells exist in the germline at the apical tip of the ovariole, the functional 
unit of the ovary. In the ovariole. gennline stem cells (GSCs) are located in a 
specialized structure called the germarium, as first proposed by Brown and 
King (1 962, 1 964). The existence of GSCs was strongly supported by germline 
clonal analysis (SchQpbach et a!., 1978; Wieschaus and Szabad, 1979) and 

20 directly verified by laser ablation (Lin and Spradling, 1 993). In each germarium, 
2-3 GSCs contact the somatic t>asal terminal filament cells. GSCs undergo 
oriented asymmetric divisions to produce a daughter stem cell which remains 
associated with the tenminal filament and a differentiated daughter, the 
cystoblast, that becomes displaced one cell away from the terminal filament 

25 (Lin and Spradling, 1997; Deng and Lin, 1997). The gennline cyst then 
becomes enveloped by follicle cells produced by somatic stem cells (Margolis 
and Spradling, 1995) to fonn an egg chamber, which buds off the gemnarium, 
joins pre-existing egg chambers in a linear anray to form an ovariole, and 
eventually develops into a mature egg (reviewed in King, 1970; Spradling, 

30 1993). This assembly-line organization, with each egg chamber representing 
a differentiated stem cell product whose position along the ovariole 
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corresponds to its birth order, provides a unique opportunity to study stem cell 
division. 

To investigate the nature of the p/m-mediated mechanism, this Example 
reports that piwi mediates a somatic signaling mechanism essential for the 
5 division and maintenance of GSCs in Dmsophila. Moreover, piwi encodes a 
novel basic protein well conserved in C. elegans and humans, and also 
conserved in Ambidopsis, where two p/vw family-like genes are known to be 
required for meristem cell maintenance. It is also demonstrated that the C. 
elegans piwi family homolog is also required for germline proliferation and 

10 maintenance. Thus, piwi family genes represent essential stem cell genes 
existing in diverse organisms. 

DrosoDliila strains and culture . All strains were grown at 25°C on yeast- 
containing molasses/agar medium. The p/w/^ piwi^ and p/wf^ mutations are 
single PZ insertional mutations reported by Lin and Spradling (1997) while the 

15 p/iv/2 mutation is a single P-/y^^ transposable insertion. Oregon R (Ore-R) 
serves as the wildtype strain for all experiments. 

Phenotvpic analysis of p/'w/ mutants . Piwi mutationswere balanced over 
CyO-y^ and introduced into the y/y genetic background. The homozygous and 
heterozygous larvae were separated at the late third instar stage by scoring 

20 mouth hooks and denticle belts for y. They were then dissected to obtain the 
ovary immediately or were aged at 25 ""C to desired pupal stages before 
dissection. Dissected ovaries were analyzed by immunofluorescence 
microscopy for defects in ovary differentiation and GSC division. 

Immunocvtochemistry and immunofluorescence microscopv . Wildtype 

25 and mutant ovaries from larval, pupal, or adult stages were dissected, fixed and 
stained as described by Lin et aL (1994). For immunofluorescence staining, 
antirVASA antibodies (Hay et aL, 1990) were used to specifically mark germ 
cells at 1:2000 dilution. Anti-a-spectrin antibodies (Byers et al., 1987) were 
used to outiine somatic cells and to mark spectrosomes and fusomes (Lin and 

30 Spradling, 1 995) at 1 :200 dilution. The monoclonal anti-MYC epitope antibody 
1-9E1 0.2 was described by Evan et al. (1 985) and was used at 1 :2 dilution. All 
the fluorescence-conjugated secondary antibodies were finom Jackson 
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ImmunoResearch Laboratory Inc., Westgrove, Pennsylvania, and were used 
at 1:200 dilution. 

Immunofluorescently labeled samples were also counter-stained with the 
DNA specific dye DAPI as described In Lin and Spradling (1993). The 

5 immunologically labeled samples were examined using Nomarski and 
epifluorescence microscopy under a Zeiss AXIOPLAN® microscope (Zeiss, 
Oberkochen, Gemnany) equipped with a STAR-1™ cooled CCD camera 
(Photometries, Tucson, Arizona). Selected samples were further analyzed by 
confocal microscopy using a Zeiss LSM410™ confocal microscope mounted 

10 on an AXIOVERT® 1 00. Images from the Zeiss AXIOPLAN® were collected 
using IP Lab software and confocal images were processed using the LSM41 0 
software. All images were processed using the Adobe PhotoshopTM program 
(Adobe Systems, Inc., San Jose, Califomia). 

Cloning of p/w/and mappino of its mutations , p/w/ maps to the left arm 

15 of the second chromosome within polytene band 32C, as defined by insertions 
of single transposable elements (Lin and Spradling, 1997). DNA fragments 
flanking the piwi^, piwi^, and piwi^ insertions were recovered by a plasmid 
rescue method (Wilson et aL, 1989). Genomic DNA In the rescued plasmid 
was used to isolate additional genomic clones from a X genomic DNA library 

20 (Maniatis et al., 1978) to constitute a 36 kb region of overiapping walk clones 
(Figure 2). The insertion sites forp/w/^, p/w/3, and p/vw^ were first mapped by 
Southem blots of EcoRI digested walk clones, and tiien precisely detenmined 
by DNA sequencing. The site of the p/w/^ insertion was detenmined by 
sequencing its flanking DNA fragments isolated by inverse PCR of p/w/2 

25 genomic DNA using primers specific to the P-eiement tenmini (Ochman et al. , 
1990). 

Genomic DNA fragments derived from the pAv/ locus were used to probe 
Northem blote of poly (A)+ RNA from wild-type male and female flies. Three 
non-overiapping transcripts, a (1.1 kb), p (0.64 kb), and c12 (3.1 kb) were 
30 identified in the region and only c12 was intenupted by p/vw mutations. 9.36 
kb of genomic DNA that encompasses the c12 transcription unit was 
sequenced either using a Sequenase kit (U.S. Biochemical) or an ABI 377 
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Prism DMA Sequencer (Perkin-Elmer, Applied Biosystems Inc). All sequence 
data was analyzed using the DNASTAR software package (DNASTAR, 
Madison, Wl). Two additional transcripts. LD08487 (1.43 kb) and LD13681, 
were identified on the right side of the P-insertfons by BLAST analysis 
(Altschul. et al„ 1990) of genomic sequence. 

Two overiapping cDNA clones which together represent the 3.1 kb full 
length c12 mRNA and the LD08487 cDNA were sequenced. Comparison 
between the genomic and the cDNA sequences precisely determined the 
intron-exon organizatton of these two transcription units. The LD13681 
transcription unitwerealso mapped by partial sequencing. Standard molecular 
btology techniques were earned out according to Sambrook et al. (1989). 

Transqenip rescue. The 6.8 kb HindlllJEcoRV {pRc12; piwi) and 3.9 kb 
PstI (pRLD; LD08487) genomic fragments were subcloned Into the P element 
transfbmnation vector pCaSpeR4 (Pirrotta. 1988). Transgenic flies were 
produced according to Spradling and Rubin (1982), using w; A2-3 Sb efTMS 
e (Robertson et al., 1988) embryos as recipients. Transgenes were then 
seperated fi-om the A2-3 transposase and Introduced into the homozygous 
piwi'f background for fertility tests by genetic crosses. Six independent pRc12 
Inserts and eight independent pRLD inserts were recovered. Rescue crosses 
were canied out at 25°C. 

Northern blot analysis a nd o/ivi cDNA isolation. Poly (A)+ mRNA 
Isolation, Northern blot preparation, and 32p.jabeled probe preparation by 
random-hexamer extensfon were according to Sambrook et al. (1 989). For Fig. 
28, the probe was madefroma 5.4 kbH/nof///genomicfragment encompassing 
the pnv/ transcription unit The same probe was used to isolate a 2.1 kb partial 
cDNA from the pNB40 third instar library of Brown and Kalatos. 1989. Using 
the sites within the polylinker (Hindlll at 5' end and NotI at 3' end), the 2.1 kb 
cDNA was subcloned into pBlueScript KS(+) (Stratagene), resulting in a 
construct denoted pDCI. A primer. pDCI 5'REV, 5'- 
ACGATAAGTTCTGTTAT-3' (SEQ ID NO: 1 1 ) was designed downstream of the 
5' end of pDCI and was used in combination with library specific primers to 
screen the Tulle Hazelrigg ovarian cDNA library (Friedman et al.. 1990). A 
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partially overtapping 500 bp PCR product was Isolated, cloned into the pGEM-T 
vector (Piomega, Madison, Wisconsin), and used to probe the same library. 
An overlapping 1 kb cDNA clone was isolated and subcloned into the Sail site 
of pDCI to produce a full length 3.1 kb p/w/ cDNA constaict denoted pDC2. 
5 Characterization of pw/homoloas from C, eleaans and Homo sapiens . 

The BLAST algorithm (Altschul et al., 1990) was used to search the non- 
redundant databases at the National Center for Biotechnology Information 
(NCBI)atthe National Library of Medicine. The Prosite Protein Motif Database 
(DNAStar, Madison, Wl) was also searched to identify domains shared 

1 0 between piwi and other known proteins. The PIWI ORF was used to query the 
SWISS-Prot database by Bl-AST for homologous proteins. Significantly, the 
search identified two C. elegans predicted proteins of unknown function. 
Expressed sequence tags (ESTs) corresponding to these two gene products 
were then identified as D2030.6 (p/gf-1 ; GenBank accession No. Z73906) and 

15 C01G5.2 {prg-l; GenBank accession NoU50068) from GenBank. Dr. Yuji 
Kohara (National Institute of Genetics, Japan) kindly provided X ZAPII 
phagemid clones corresponding to prg-^ and p/p-2. 

The p/iv/cDNA was further used to query the NCBI dbEST database and 
identified an EST, zw68hG1 .r1 , isolated from a human testes cDNA library, 

20 GenBank accession number AA430311. This 0.9 kb EST clone was 
sequenced and used to screen an oligo-(dT) primed human testis cDNA library 
(Clontech, Palo Alto, California). A 2.3 kb cDNA clone conBsponding to hiwi 
was isolated and sequenced. 

Whole-mount tissue In situ hybridization . Ovarian whole-mount tissue 

25 in sHu hybridization was perfbnmed as described in Yue and Spradling (1992) 
with DIG-iabeled (Genius Kit, Boerhinger Mannheim) full length p/vWcDNAfrom 
pDC2 as a probe. For detecting RNA in surface ceils, ovaries were digested 
5 minutes with 50 ^g/ml proteinase K. Under this condition, only temiinal 
filament cells and anterior sheath cells, but not any other surface cells, were 

30 stained with the piwi cDNA probe. The pBST control probe did not show any 
significant signal. For detecting RNA in inner geanariai cells, the digestion was 
7-10 minutes. Under this condition, the piwi probe revealed a hybridization 
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pattern as shown in Results, the pBST control probe did not show any 
significant signal, while the positive control oskar probe showed specific 
staining in presumptive oocytes (Ephrussi et al., 1991). Staging of egg 
chambers is according to King (1970). 
5 Genetic clonal analyses and qemriarial transplantation , Thep/w/^and 

piwi^ gennline clones were generated with the FLP-DFS technique as 
described in Chou and Penimon (1996). y, w PJhsFLPp^; P[ov€p1]2L 
FRT^OA/cyO males were crossed to w; piwi^, FRT^OA/cyO virgin females to 
produce y,w PlhsFLPp^; P[ovoD1j2L FRT^^/piwi^, FRT^OA progeny, 

1 0 Identical crosses were performed with w; piwi^, FRT^OA/cyO virgin females. 
The genotype of the flies were verified both by Southern analysis for the 
presence of piwi P insertion and the FRT as well as by sterility tests by 
backcrossing to the original piwi allele. Crosses were canied out for two days 
to produce transheterozygous progeny. After 2 days, adults were transferred 

15 to fresh vials. Larvae were heat shocked on days 3 and 4 for one hour in a 
37*^0 water bath to induce mitotic recombination. The heat-shocked females 
with gennline clones were crossed to Ore-R males for fertility tests and their 
ovarian phenotype examined by immunofluorescence microscopy. 

To generate somatic clones of piwi^ and p/w/2 the above described 

20 p/iv/^ FRT^OA/cyO and p/iv/2 FRT^^^/CyO strains were crossed to w; 
ppisFLPfif^; PlW^i hs-NM], FRT^OA/cyO males as described in Xu and Rubin 
(1993). Females of the genotype w, P[hsFLPfiF;P[W^:hs-NMl FRT^OA/p j^i, 
FRT^OA vvere crossed to Ore-R males for fertility tests. Clonal induction was 
carried out as described above except that females of the appropriate genotype 

25 were subjected to an additional 20-minute heat shock at ST^'C 90 minutes prior 
to dissection to induce the expression of the myc tag. Germarial 
transplantation was earned out as described by Lin and Spradling (1993). 

RNA-mediated interference (RNAi) . Templates for in vitro transcription 
were cloned Into pBluescript KS (+) and transcribed using either T7 or T3 RNA 

30 polymerases according to the manufacturer's protocol (Megascript T7 and T3 
kits, Amblon). prg-1 cDNA served as the experimental RNA for injection while 
pBluescript KS(-»') RNA was used as a negative control. RNA integrity was 
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determined by gel electrophoresis; concentration was determined by a 
combination of UV spectrophotometry and ethidium bromide staining. 
Injections used an equimolar mixture of uncapped sense and antisense RNA 
at a concentration of 10 mg/ml. Gonadal injections of wild type N2 Bristol 
5 hermaphrodites was done as described in Melio and Fire (1 995). 

After recovery and transfer to standard solid media, Injected animals 
were transferred to fresh culture plates at 24 hours intervals to facilitate the 
identification of phenotypic differences. Germlines of progeny produced 
between 6 and 48 hours after injection were examined for gamete 

10 differentiation either by DAPI staining or fertility. For DAPI analysis, adult 
hermaphrodites displaying RNAi induced phenotypes were fixed overnight in 
Carnoy's solution (60% ethanol; 30% acetic acid; 10% chlorofomi; Sulston and 
Hodgkin, 1988). The mitotic index is defined as the ratio between the number 
of prophase, metaphase, anaphase, and telophase nuclei and the total number 

15 of nuclei in ihe MPZ. 

Piwi mutations eliminate the growth, proliferation and self-renewing 
division of germline stem cells , p/iv/^ mutant ovaries contain a normal number 
of mispositioned GSCs at the onset of oogenesis at the late third instar larval 
stage, which, however, leads to an equal or somewhat smaller number of 

20 gametes in the adult gonads that no longer contain GSCs (Lin and Spradling, 
1997). This failure of gennline maintenance could be due to: (1) the 
differentiation of GSCs without growth, proliferation and self-renewing divisions; 
(2) a defect in the asymmetry of GSC division, producing aberrant germ cells 
that eventually degenerate; and/or (3) a secondary defect influenced by 

25 abnormal ovary differentiation. 

To examine whether the failure of germline maintenance is indeed a 
secondary defect due to abnomial ovary development, the ovarian morphology 
of piwfi and p/wP mutants which also fail to maintain GSCs was examined. 
The examination relied on Nomarsld optics as well as markers that specifically 

30 identify gemi cells, somatic cells, and spectrosomes/fusomes, gennline specific 
organelles that indicate individual stages of genm cell development (Lin et al., 
1994; Lin and Spradling, 1995). These mutant ovaries show normal 
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morphology at the third instar larval stage. Their germline cells are normal in 
number and are conrectly positioned along the medial plane of the ovary. 
Moreover, the expected number of terminal filaments are forming, so that at the 
pupal stage, the ovary differentiates nonnally, partitioning GSCs and their 
5 daughter cells comecUy into individual germaria and ovarioles. GSCs are able 
to divide several times to provide a normal complement of germ cells to the 
gemiartum. Yet, GSCs subsequently fall to continue self-renewing divisions, 
and the existing gennline cysts often degenerate during the late pupal stage 
so that the adult ovarioles contain gennlineless germaria and fewer egg 
1 0 chambers than expected. These observations indicate that the failure of GSC 
maintenance in piwi mutants is not a secondary defect due to abnormal ovary 
development. 

To examine whether the failure of GSC maintenance is due to 
differentiation without self-renewing divisions or due to aberrant divisions 

15 followed by degeneration, the ovarian development of the piwi^ mutant was 
examined more closely. The piwi^ third instar larval ovaries contain a normal 
number of 50-70 presumed GSCs that are mis-positioned. However, these 
GSCs still seem to, and indeed must have, partitioned during subsequent pupal 
development, since the number of ovarioles formed in most adult ovaries is 

20 within the normal range (13+ 3 in mutant vs. 15+3 in wildtype). Hence, the 
effect of p/w/^ on GSC mis-positioning is not essential for oogenesis. 

The main oogenic defect of the piwi^ mutant is the differentiation of 
GSCs without self-renewing divisions immediately following the initiation of 
oogenesis. At this stage in wildtype ovaries, GSCs in 16-23 newly fomied 

25 germaria have initiated asymmetric divisions to generate multiple developing 
germline c^sts. However, in mutant ovaries, GSCs and their immediate 
differentiated daughters, cystoblasts, are undetectable, as indicated by the 
absence of spectrosome-containing germ cells. Instead, most ovaries contain 
differentiated germline cysts whose number approximately equals that of 

30 GSCs. These cysts are much larger in size and contain 2- to multi-cell stage 
fusomes, indicating their differentiating state. By the adult stage, most 
ovarioles contain only two nonmal or abnonmal egg chambers derived from 
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these cysts, but no other germline ceils. This defect contrasts the development 
of wiidtype ovaries, in which ovarioles contain a fully developed gennarium and 
a stage 1 egg chamber by the 48-hour pupal stage, and have produced 
multiple egg chambers by adult stage. This observation indicates that the 
5 mutant GSCs have differentiated Into germline cysts without self-renewing 
divisions. The abnormal fusome morphology often seen in cysts suggest that 
they contain more or less than the nonnal number of 16 cells, which indeed 
was seen in adult egg chambers. 

Thus, piwi is required for the self-renewing division of GSCs during 

10 oogenesis in Drosophila. Cohfocal images showing third Instar larval ovaries 
from wild-type, p/w/2, and piwi^ mutant flies, respectively, were prepared and 
stained with a gemnline specific marker VASA in green and spectrin in red to 
mark somatic cells and spectrosomes/fusomes. GSCs were positioned 
medially along the anterioposterior axis in the wild-type and piwi^ ovary; 

15 however, GSCs are mispositioned in the piwi^ mutant ovary. Spectrosomes 
(Sp) appear as red dots in germ cells, A 50pm measurement bar was used in 
the analysis of the confocal images. 

Confocal images of 24-hour pupal ovaries from wild-type and piwi^ 
mutantflies, respectively, were prepared, with wild type stained solely for VASA 

20 while piwi^ mutant was stained for both VASA in green and spectrin in red. At 
this stage, the wild-type ovary has partitioned into individual gemnaria (Ge), 
which contain GSCs and early germline cysts. In piwi^ mutant ovaries germ 
cells still appear to have been partitioned, although the partition is distorted by 
the highly differentiated large genmline c^sts containing elaborate fiisomes 

25 (Fu). The number of cysts did not exceed the number of GSCs. 

Confocal images of 48-hour pupal ovaries from wild-type, p/iv/2 and 
piwil mutant flies, respectively, were prepared and stained with VASA in green 
and spectrin in red. By this stage, in both wild-type and p/w/2 mutant ovaries, 
gemnaria have fully developed and stage 1 egg chambers (S1) have budded 

30 off the germarium in synchrony due to continued GSC division. However, in 
p/w/^ mutant ovaries, the number of gemnline cysts remained unchanged, 
despite their continued differentiation. 
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A confocal image of a 72-hour pupal ovary from piwi^ mutant flies was 
prepared and stained with VASA in green and spectrin stained in red. At this 
stage, GSCs fail to self-renew and some existing genmline cysts start to 
degenerate. DAPI images of 0-1 day old adult ovarioles from wild-type, piw^ 
5 and piwi^ mutant flies, respectively, were also prepared. Both piwi^ (L) and 
piwi^ (H) mutant ovarioles typically contain two nonmal or abnormal egg 
chambers connected to a germlineless gennarta (Ge). In contrast, wild-type 
ovarioles contain a long string of developing egg chambers produced 
continuously by the gennarium. piwi^ phenotype is indistinguishable from that 
10 ofp/w/2 

Molecular cloning of Drosophila piwL To study the molecular nature of 
the p/V/-mediated mechanism, a 15 kb genomic region flanking the piwi 
insertions was cloned. Candidate transcripts in the region were identified, and 
a 6.8 kb genomic fragment (pRc12) containing a 3.1 kb transcript intenrupted 
15 by piwi mutations (transcript c12) was able to rescue the piwi phenotype 
completely (Fig. 1). An overiapping 3.9 kb genomic fragment (pRLD) 
containing a 1 .43 kb transcript immediately adjacent to the piwi mutations fails 
to rescue the piwi phenotype (Fig. 1 ). Therefore, the c12 transcript is the piwi 
mRNA. 

20 Drosophila PIWI represents a novel class of conserved proteins . To 

study the structure of the piwi gene, two overiapping cDNA clones which 
together represent the 3.1 kb full length c12 mRNA were isolated and 
sequenced. The c12 cDNA sequence contains a large open reading frame 
(ORF) starting at 84 bp and encoding a protein of 843 amino acid residues 

25 with a predicted molecular weight of 97.2 kD (SEQ ID N0s:1 and 2). Multiple 
stop codons in all three frames are present upstream of the putative start 
codon. Downstream of the ORF is a 432 bp 3' untranslated region (UTR) 
followed by a 55-bp poly(A) tail not encoded in the genomic sequence. Thus, 
piwi RNA is present in piwiVCyO adult males and females. Northem blots 

30 including female and male lanes containing approximately 10 and 1.5 mg of 
polyA*** RNA, respectively, were prepared. They were exposed for 3.5 and 12 
hours, respectively, and a band was observed at 3.1 kb in each lane. 
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The PI Wl protein is a highly basic (pi 9.6) novel protein with no obvious 
similarity to other known proteins or functional motifs in the databases. It is 
characterized by alternating basic and acidic regions and is particularly basic 
over the C-tenninal 100 amino acid residues. Hydropathy analysis indicates 
5 that the PIWI protein contains no significant local hydrophobic regions that 
could be potential signal peptide or transmembrane domains. PSORT analysis 
(Nakai and Kanehisa, 1992) predicts nuclear localization of the PIWI protein. 
The protein has 21 conserved protein kinase C phosphorylation sites, 14 
casein kinase 2 phosphorylation sites, and 4 tyrosine phosphorylation sites, 

10 indicating its potential as a phosphorylation target 

To detemiine whether the PIWI protein is consen/ed during evolution, 
applicant searched for its homologous sequences at the protein level and 
identified two ORFs of unknown function from C. elegans and an expressed 
sequence tag (EST, AA43031 1) from a human testis cDNA library. Applicant 

15 isolated and sequenced cDNAs for the two C. elegans genes, herein named 
prg-1 and prg-2 {prg for piwi-related gene) to verify their homology to 
Drosophila piwL The prg-1 and prg-2 genes share 40.1% and 38.5% amino 
acid identity to p/vw, respectively, over their entire length. In the C-terminal 104 
amino acid region, the homology increases to 55.8% and 56.7%, respectively. 

20 Moreover, prg-1 and prg- 2are 90% identical to each otherovertheirfull length 
and 98% identical at the C-tennfiinus. This high degree of homology suggests 
fhatprg-l and pfg'2 may represent a gene duplication event The two clones 
differ primarily in that prg-t is 60 amino acids longer at the N-temiinus than prg- 
2. Using ACeDB (Thieny-Mleg and Durt)in, 1992), prg-1 was mapped to 

25 chromosome I between unc-f 5 and gld-1 in cosmid D2030 and prgf-2 was 
mapped to chromosome IV» between unC'44 and smg-Jon cosmid C01G5. 

To isolate human piwi homologs, the human EST (0.9 kb) done was 
sequenced and used to screen a human testis cDNA library. A resulting 2.3 
kb partial cDNA, herein named hiwi (for human p/wi), shows 47.1% identical 

30 amino acid sequence to the Drosophila piwi over its full length, with 58.7% 
identity at the C-temiinus. No p/wnrelated sequences were found firom bacteria 
or yeast genomes whose entire sequences are known. This is consistent with 
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the Stem cell-related function of p/Wand Indicative of p/M^/ family-like genes 
specific for multicellular organisms. 

I n addition to the above p/wi family homologs, a large number of putative 
and known proteins were identified from various animals and plants which 
5 share significant homology with piwi solely at their respective C-termini. This 
indicates that the p/iv/ family is a large novel gene family. Within this family are 
13 additional putative C. elegans proteins and three A. thaliana proteins, two 
of which are required for meristem cell divisions (Moussian, et al., 1998; 
Bohmert, et aL, 1998). Particularly, between piwi and the three Ambidopsis 

10 genes, ZWILLE (ZLL) argonaute (ago), and argonaute-iilce (Moussian et aL, 
1998; Bohmert et aL, 1998), an overall homology of approximately 20% amino 
acid identity was observed. The homology increases to 32% to 52% identity 
in four regions, 30-1 00 amino acid residues each, located throughout the length 
of the PIWI protein, including the highly conserved C-terminal region. Given 

15 the role of ZLL and ago in meristem cell division (Moussian et aL, 1998; 
Bohmert et aL, 1998), this homology may have important implications for a 
conserved stem cell mechanism. 

PIWI, PRG-1 , PRG-2, and HIWI differ from ZLL and AGO proteins, and 
especially from the 13 additional putative C. elegans proteins, predominantly 

20 at the N-temninus, suggesting that this region may be involved in p/w/-specific 
function. The C-terminal conservation suggests that this region of PIWI may 
contain a novel functional domain that plays an important role for the general 
activity of these proteins in diverse biochemical processes, with the N-terminus 
rendering the specificity of the activity. 

25 To examine the C-tenminal region of homology more closely, the 

sequences were aligned using Block Maker*^*^, which reveals characteristic 
regions of protein families (Henikoff et aL, 1995) (Figure 2). Block Maker*^*^ 
analysis identified a 43 amino acid domain conserved among all 22 proteins, 
within which five residues are absolutely conserved with defined spacing. Eight 

30 more residues are also conserved with defined spacing among all known 
genes across the phyla except for several C. eiegans ORFs with unknown 
function. This region is referred to herein as the PIWI box and represents a 
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novel conserved functional nnotif. PiWI,thdtisthep/\v/fanriily of gene products, 
thus represents a novel class of evolutionarily conserved proteins with 
consented functionality, as described herein below. 

DrosoDhila o iwi nnRNA is oresent in both the aemnline and soma during 
5 oogenesis. To Investigate in which cells piwi is expressed to regulate GSC 
division, the expression pattern of p/Mr/ during oogenesis was examined by in 
situ hybridization of whole mount Dmsopiiila ovaries with DIG-labeled DNA 
probes prepared from the p/w/cDNA clone, piwi mRNA is detected specifically 
in the somatic temninal filament cells apical to GSCs in the gennarium and 

10 anterior sheath cells as well as in the gemnline. In the gemiline, it is first 
abundantly expressed in region 2 of the gennarium where 16-cell germline 
cysts are formed, persists at a lower level through stages 1-6 of oogenesis, is 
at its lowest level between stage 7-9, becomes strongly expressed again at 
stage 10, and eventually accumulates in eariy embryos (for staging, see King, 

15 1970). Given that removing sheath cells does not affect oogenesis whereas 
the temninal filament cells play a role in regulating GSC division (Lin and 
Spradling, 1 993), it is likely that the somatic expression in the terminal filament 
is responsible for p/Vw function in regulating GSC division. 

Thus, piwi mRNA expression in Drosopiiila ovaries was characterized. 

20 A complete Ore-R ovariole was studied and showed gemiiine piwi expression 
in region 11 of the germarium (Gell) and in eariy stage egg chambers up to 
stage 10 (S10) was studied, in the germarium, the p/w/RNA was also detected 
in the temiinal filament ceils (TF) and epithelial sheath ceils (ES) apical to 
GSCs. piwi RNA was also observed to be unifonmly present in eariy embryos. 

25 The aforementioned structures were reviewed on slides, and the slides 
included bars denoting 50 ^m, 10 ^m and 50 (tm for ovariole. germarium and 
eariy embryos, respectively, to facilitate review. 

Piwi expression in apical somatic cells regulates GSC division while its 
exoression In the oermline provides a maternal component for embryoaenesis , 

30 To examine the roles of somatic and germline piwi expression, genetic clonal 
analyses on p/w/^ and piwi^ mutations was conducted. First, the genmline 
requirementof p/Vw was tested by germline clonal analysis using the FLP-DFS 
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technique (Chou and Perrimon, 1996). The ovo^^ dominant mutation blocks 
oogenesis at the very beginning stage in a ceil-autononfK)Us manner, so that 
p/wi*^ ov(P^ females contain germ cells that never differentiate beyond the 
germarium. In this ovcP^ baclcground, applicant generated germline cells 
5 homozygous for either the p/vw^ or p/iy/2 mutation by applying a heatshock 
treatment at the second and early third instar larval stage. The resulting adult 
females developed some completely normal ovarioles in which germline cells 
at all stages of oogenesis were observed, and mature eggs were continuously 
being produced (see Table 3). 

10 This result demonstrates that removing the piwi^ovo^^ chromosome 

from the germline allows oogenesis, including GSC division, to occur nomnially. 
Thus, the requirement of p/vw for GSC division does not reside in the gemiline 
but in somatic cells. 

Eggs produced from homozygous p/w/^ or p/w/2 GSC clones are 

15 anrested in embryogenesis, not rescuable by the paternal piwi^ gene (see 
Table 3). Approximately 30% of the arrested embryos show severe mitotic 
defects during cleavage stage. The remaining embryos show various 
morphological defects during late embryogenesis, including a high frequency 
of severe defomnation of the head region. These defects demonstrate that p/Vw 

20 expression in the gemiline provides an essential maternal contribution for 
embryogenesis. 

To determine whether p/iv/ is required in somatic cells outside the ovary 
for GSC maintenance, wildtype gemiaria were transplanted into the abdominal 
cavity of homozygous piwfl and wildtype females, which produced 17±0 egg 

25 diambers (n=1 3) after 7 days of incubation in the host. Parallel transplantation 
into wildtype host yielded 16+2 egg chambers (n=5), indicating that the 
wildtype gemiaria continued oogenesis at a nonmal rate in the piwi^ females. 
Hence, p/iv/ is not required In extra-ovarian cells for GSC maintenance. 

To detemnine in which group of ovarian somatic cells p'm is required for 

30 GSC maintenance, applicant used a similar FRT-mediated technique for 
somatic clonal analysis of p/w/^ and piwi^, marking tiie p/iv/^ chromosome with 
a transgenic myc tag so that homozygous piwt somatic cells are identified by 
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their lack of myc epitope expression (Xu and Rubin, 1 993), By inducing mitotic 
recombination at the second and third instar larval stages, p/ivr somatic clones 
were generated throughout ovarioles, with some egg chambers completely 
covered by piwi mutant follicle cells. These egg chambers develop normally. 
5 Indicating that piw function is not required in follicle cells for egg chamber 
development. Since follicle cells are derived from their precursor cells in region 
II of the germarium (Margolis and Spradling, 1995), this suggests that piwi is 
not required in somatic cells from gemnarial region II on for GSC division and 
ovarian development. These results, together with the pnv/ expression pattern 

10 in situ, suggest that piwi is required in somatic cells in the anterior-most tip of 
the gennarium to regulate GSC division. 

Thus, piwi function is dispensable In the germiine and differentiated 
follicle cells for GSC division. A pair of ovaries containing ovarioles with 
homozygous p/wr germiine clones and ovarioles with p/vw+ ovo^^ germiine 

1 5 stained for DAPI to mark DNA and for VASA to mark germ cells for observation 
via confocal microscopy. The piwi^ ovo^^ germiine cells were anrested at the 
beginning of oogenesis in the germarium due to the ovo^^ mutation. By 
contrast, in ovarioles in the left ovary containing p/ivr germiine clones, GSCs 
divided normally, giving rise to a progression of morphologically wild-type egg 

20 chambers which eventually develop into mature eggs. This indicates that piwi 
is not required in the gemiline for GSC division and subsequent steps of 
oogenesis. To facilitate confocal microscopy, bars denoting 100 |im were 
employed in each slide. 

Confocal microscopy was also employed to show that piwi' clones in 

25 differentiated follicle cells do not effect normal GSC division and egg chamber 
development. Egg chambers were stained with DAPI and a monoclonal anti- 
MYC antibody. Absence of MYC staining was indicative of homozygous piwr 
clones. Induction of somatic clones around an entire eariy stage egg chamber 
had no effect on GSC division and oogenesis. Simllariy, rennoving p/w/ function 

30 from some follicle cells of later stage egg chambers also had no efFect on 
oogenesis. Therefore, p/w/ function is dispensable in differentiated follicle cells 
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for oogenesis including GSC division. To facilitate confocal microscopy, bars 
denoting 50 were employed in each slide. 

DiwhWke genes in C, eleaans are also required for aermline self-renewal . 
The high degree of sequence homology between p/vw and its homologs in other 
5 organisms suggests a potential functional conservation. This hypothesis was 
tested in C. elegans. Quo and Kemphues (1995) have shown that the injection 
of specific antisense RNA into the gemiline syncytium of C. elegans eliminates 
matemal and zygotic gene activity, producing a gene-specific loss-of-Hinctlon 
effect that may persist through several generations. This technique, as refined 

10 by Fire et al. (1 998) and temned RNA-mediated interference (RNAI), was used 
to assess the function ofprg-1 and prg-2. Given the extremely high homology 
between prg-l and prg-2, an anti-prg-f RNA was used for injection to interfere 
with the function of both genes. The F1 progeny of the injected worms were 
designated as prg-RAM/ worms for simplicity. 

15 In wild-type C. elegans, two germline precursor cells, Z2 and Z3, give 

rise to approximately 2000 gemn cells in the adult hennaphrodite. Germline 
proliferation occurs throughout most of larval development (L1-L4) and 
continues in the adult. This proliferation and maintenance of the germline 
requires signals from the DTC at the tip of each gonadal arm. During gonadal 

20 development, DTC migration results in the fomiation of two U-shaped gonadal 
arms by the L4 stage. Gemnline proliferation is limited to the distal end of each 
arm, fonning a mitotic proliferation zone (MPZ) which serves as the GSC 
equivalent in C. elegans. Moving proximally, near the U turn of the gonad, 
germ cells enter meiotic prophase and then further differentiate into gametes 

25 at the proximal half of each ami, producing sperm in L4 and then oocytes in 
young adults. 

The phenotype of pfg-/?AM/ wonms was examined both by quantltating 
their fertility and by assessing their gonadal and germline development via 
DAPI staining and Nomarski optics. As controls for RNA injection, worms were 
30 injected witii BlueScript plasmid RNA sequences. 

The prg-RNAi worms show three major aspects of genmline defects. 
First, as indicated by the distribution of the number of progeny per F1 animal 
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(Fig. 3), the fertility difference between the prg-RNAi and control wonms is 
highly significant (t value=7.93, p<0.0001). On average, prg-RNAi wonms 
produced 92± 53 progeny (n=99), with 75.5% of p/g-/?AM/ wonns giving rise to 
fewer than 125 progeny. By contrast control RNAi animals on average 
5 produced 191 ±40 progeny (n=20), with 90% of these animals giving rise to 
greater than 125 progeny. The significant reduction of the fertility, as well as 
other defects described below, may still only reflect a partial loss of prg-l and 
prg-2 function, since the RNAi technique is known to phenocopy partial rather 
than complete loss-of-function mutants (Kuwabara, 1996). 

10 Second, DAPI staining reveals a dramatic shortening of MPZ in prg- 

RAM; worms. On average, adult p/g-/?AM/wonns exhibited a 50% reduction 
in the number of mitotic nuclei as compared to control animals. Associated 
with the reduction of the mitotic zone Is a gonadal shortening. 57% of prg- 
ffAM/ worms (n=287) exhibited a moderate to severe shortening. In the most 

1 5 severe case, the gonadal arm never made the U-tum. The number of sperm 
produced in these worms is greatly reduced as well. By contrast, only 7% of 
the BlueScript-RNA-injected control animals (n=548) exhibited a mild gonadal 
shortening. The above observed gonadal defects in p/y-RAM/ womris indicate 
that p/g-7 and pATgf-2 are essential for gennline proliferation and maintenance. 

20 Since the gonadal shortening may be due to a defect in DTC migration, it is 
also possible that prg-1 and prg-2 play a role in proper gonadogenesis. 

Thus, reducing prg activity results in reductions in both the MPZ and the 
mitotic index. Variable defects in both the size of the MPZ and the mitotic 
index were observed by DAPI staining and preparation of photomicrographs 

25 of same. The MPZ In each micrograph was denoted in white brackets, in 
moderate to severely defective animals, a 50% reduction in the number of 
mitotic nuclei and a concomitant five-fold reduction in tiie mitotic Index were 
typically obsen/ed. A measurement bar was included in the wildtype 
photomicrograph and denoted 100 ^m for all panels. 

30 In addition to the MPZ shortening, the mitotic Index in the remaining 

mitotic zone is further reduced by 5.5 fold from (7.1+2.2)% (n=1,339 nuclei, 
ns=10 womis) in the control RNAi injected worms to (1.3+0.8)% (n=1,409 
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nuclei, ns=10 worms) in prg-RNAi worms with mildly to moderately shortened 
gonads. This indicates the important role of prg-1 and prg-2 In maintaining the 
mitotic ability of the germilne stem cell nuclei. 

DrosoDhil a oiwi is required for the self-renewing divisions of stem cells 
5 in thQ qerpfillne. The above experimentation demonstrates that piwi provides 
for female and male GSC maintenance predominantly by promoting the self- 
renewing division of GSCs. In a gemiarium, it nomially takes at least 1 0 hours 
for a GSC division to occur and 5 days for a cystoblast to develop into a stage 
one egg chamber (King. 1970; Lin and Spradling, 1993). Necrosis and 

10 apoptosis during oogenesis are also easily detectable. Thus, examination of 
the piwi mutant phenotype at 24-hour intervals immediately prior to the onset 
of oogenesis is sufficient to monitor every major event in germline proliferation, 
differentiation, or cell death. Under these conditions, GSC divisions would 
have been detected as an increased number of germline cysts, whereas cell 

16 death process would have been easily detected by looking for pycnotic nuclei 
and necrotic bodies via DAPI staining and Nomarsici optics (Lin and Spradling, 
1993; Forbes et al., 1996a, b; Lin and Spradling, 1997). Numerous cases of 
GSC divisions and cell death in the germline during Dmsopliila oogenesis have 
been detected using these criteria (Lin and Spradling, 1993; Forbes et al., 

20 1996a, b; Lin and Spradling, 1997). 

Despite their high sensitivity, the techniques followed in this Example, 
even though more systematic, did not detect either GSC division or cell death 
in piwi mutants. Instead, immediately following the onset of oogenesis, only 
differentiating cysts were detected in piwil mutant ovaries, with tiie cyst 

25 number generally connesponding to the number of GSCs. Although this does 
not rule out ttie possibility ttiat Infrequent GSC divisions or cell deatii might 
have been missed, which might account for the smaller-than-^xpected number 
of gametes sometimes obsen/ed in adult piwi mutants, tiiese factors do not 
weaken the conclusion that the predominant defect of piwi mutants is the 

30 differentiation of GSCs witiiout accompanying self-renewing division. 

A numberof elegant studies have identified genes that regulate germline 
proliferation. In mammalian systems, mechanisms mediated by paracrine 



WO0(K32039 PCT/US99/28764 

-94- 

factofs such as the Steel factor (the ligand of the c-KH receptor tyrosine 
kinase), ieul(emia inhibitory factor (LIF), and basic fibroblast grxnwth factor 
(bFGF), control the proliferation and survival of primordial genu cells, i.e., the 
precursors of GSCs (reviewed in Lin, 1997). In C. elegans, the LAG-2/GLP-1 
5 pathway is essential for the self-renewal of a stem-cell equivalent population 
of gemnline nuclei In the gonad (reviewed In Kimble and Simpson, 1997). In 
the Drosophila ovary and testis, the BAM-mediated mechanism is involved in 
the further development of the differentiated GSC daughter (McKearin and 
Ohistein. 1995); the dpp pathway plays a similar role in testis (Matunis et al. 

10 1997). More recently, this pathway has been shown to be essential for GSC 
division and maintenance In the ovary (Xie and Spradling, 1 998). Among these 
discoveries. p/Vw is the first gene known to mediate a somatic signaling 
mechanism to regulate the division and maintenance of GSCs in Drosophila. 
It therefore provides an important entry point for studying the somatic control 

15 of stem cell division in the germline. 

The D/vWaene famil v and the conservation of a stem cell self-renewing 
mechanism . The identification of prg-1 and prg-2 in C. elegans and hiwi in 
human suggest that piwi represents a novel class of evolutlonarily conserved 
genes (described herein as the "p/w/family") with potentially important functions 

20 in GSC self-renewal. Among this class of genes, the significantly higher 
homology between piwi and hiwi as compared to that between piwi and prg- 
1/prg-2 suggests that /i/iW function is ctoser to p/W/. Consistent with this. GSC 
division and gametogenesis In humans are much more similar to that in 
Drosophila than that in C. elegans whose gonads contain syncytial mitotic 

25 genDline nuclei whteh divkle symmetrically and are capable of self-r«newing 
only as a population. The RNAI experiments described herein, which likely 
interiiered with the activity of both prg-1 and pngr-2 due to their extremely high 
homology, caused gennllne depletion similar to that in p/vw" mutants. This 
suggests that the p/wnnedlated mechanism in germline self-renewal Is 

30 conserved even in this evolutionarily distant organism without stereotypic 
GSCs. 
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The conservation of the p/ivA-mediated mechanism appears to extend to 
the plant kingdom as well. A level of homology between PIWI and ZLL and 
AGO was observed. Intriguingly, ZLL is essential for maintaining stem cells of 
the shoot meristem in an undifFerentiated state during the transition from 
5 embryo-specific development to repetitive organ formation through tiie self- 
perpetuating shoot meristem divisions (Moussianetal., 1998). AGO also plays 
an Important role in maintaining nonmal apical shoot meristem function 
(Bohmert et al., 1998). Thus, the homology between piwi and ZLL and ago 
further suggests the existence of a family of novel genes essential for stem cell 

10 division in diverse organisms. 

The conserved piwi mechanism may mediate cell-cell Interactions. In 
Dmsophila, genetic mosaic and p/iv/ expression analyses together suggest that 
p/i/w function is required in the apical non-mitotic somatic cells to control GSC 
division. Similariy, in A, thaliana, it is thought that ZLL is required to maintain 

15 the undifferentiated state of shoot meristem stem cells by relaying positional 
infomiation, possibly by mediating cell-cell interactions within the center of the 
shoot meristem (Moussian et al., 1998), 

Stem cells are characterized by two common properties that extend 
across diverse species: tiie capacity for self-renewal and tiie abilHy to give rise 

20 to numerous progeny that are fated for further differentiation (for review see 
Lin, 1997; Monison etal., 1997). Although significant progress has been made 
in identifying genes important for stem cell function, no common molecular 
mechanism shared by diverse stem cell types in diverse organisms has been 
characterized with respect to those two basic stem cell properties. The piwi 

25 gene family represents the first class of such genes. The analysis of the piwi 
gene family, therefore, provides an important first step towards the elucidation 
of molecular mechanisms underiying stem cell divisions. 

The analyses of this Example were extended to homologs in C. elegans, 
an isolated human homolog {tiiwi), and identified two p/iv/-like genes in 

30 Arabidopsis which are known to maintain meristem cells. This work reveals a 
new class of genes that play an essential role in stem cell division in 
multicellular organisms ranging from invertebrates to humans and plants. 
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Table2 

piwi l rescue results by pRc12 and pRLC^ 
piwiVpiwil fmnVCyO pRc1 2; piwil /piwil pRUD; pml/^iwif 
females 0(136) 100(45) 100(39) 0(81) 

5 males 0(119) 97.8(47) 88.4(43) 0(64) 

The percentage of fertile flies for each genotype and sex are indicated. The 
number of individuals tested for fertility (n) is in parentheses. 
"Data are pooled from 6 and 4 independent insertion lines on the X and 3rd 
chromosomes for pRLD and pRc12, respectively. Not included are two other 
10 pRc12 rescue lines which showed sex-specific restoration of fertility, 
presumably due to position effects stemming from the insertion site. 

Table 3 

Diwi function i n GSC division Is soma-deoendent. while its function in 
embrvoaenesis is aermline dependent* 
15 FLP/FRT oi/oD^-induced germline clones Heatshock N Avg.#eggs 
% embryos laid/female hatched 



{a)piwl1, FRVovo^l, FRT x Oregonf^ males + 


37 


50.4 0 


(b)p/w/2 FRT/ovo^t, FRT x Oregon^ males + 


26 


66.7 0 


(c) p/w/^ FRllov<P1, FRT X OregonF^ males - 


44 


0 na 


(d) p/V/2 FKXlovo^f, FRT x Oregon^i males - 


26 


0 na 


(e) ovoP'f, FRT/CyO x Oregorf^ males + 


20 


0 na 


(f) ovoPt, FRTlCyO x Oregoi^ males - 


10 


0 na 



25 * In all tester and control crosses, a number (N) of newly eclosed females were 
allowed to lay eggs at 25°C for 2 days. In both piwil and p/iv/2 testers, 
gemiline develops beyond that of ovo^^ mutant ovarioles only following heat 
shock, giving rise to numerous eggs. This Indicates the generation of piwr 
germline clones and the dispensability of piwi function in the germline for 

30 nomial GSC division. However, none of the eggs develop beyond 
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embryogenesis, indicating the gemiline piwi expression is required as a 
maternal component for embryogenesis. Tlie ovo^^, FRT controi females lay 
no eggs eithier with or without heatshock. na=not applicable. 

ExPmplQ 2 

5 Molecular Cloning and Exp ression Analysis of Hiwi 

A Human Homoloa of the Drosophila Piwi Gene Essential fnr 

Gemfiline Stem Cell Division and E mbrvoaenesis 
piwi is a novel gene first identified in Drosophila and was demonstrated 
to be essential for germline stem cell development in Example 1. In this 
1 0 Example the cloning of a 3.47 kb human piwi homolog that includes the whole 
open reading frame through cDNA library screening and 5" RACE is described. 
Sequence analysis showed high identity of HI Wl to Drosopliila PIWI, especially 
in the C-terminal domain of the protein. ChronuDsome mapping showed located 
/7/wito Chromosome 12q 24. 33, approximately 130kb from the AFM 295 ye5 
1 5 marker. Northern blot analysis revealed that tiiwi is only expressed in the testis 
among human adult tissues. The sequence and expression homology between 
human and Drosoptiila piwi genes suggested that piwi is conserved during 
evolution and plays an important role during eariy stages of spennatogenesis 
in human. 

20 Human spermatogenesis followed a precise and coordinated manner in 

the seminiferous epithelium of testis (reviewed in Bellve, 1979; Thomas et aL, 
1989; Hoog, 1995). During this period the gemi cells undergo a series of stage- 
specific morphological transitions, beginning with the proliferation and renewal 
of spemiatogonia, followed by meiosis of spemiatocytes, and finished by 

25 differentiation of spennatids during spenniogenesis (Oakberg, 1956a; Nebel et 
al., 1961; Bellve et aL. 1977; Russel et al., 1990). Closing to the basal 
membrane, type A spemiatogonia are stem cells (Bartmanska and Clenmont. 
1983; De Roolj, 1988), undergoing a series of mitoses to give rise to more type 
A cells or produce type B spemiatogonia which passes through a mitotic 

30 division to differentiate into spermatocytes. 
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Spermatocytes need to undergo a final round of DNA replication before 
entering meiosis. During meiosis, the synaptonemal complex and the 
recombination nodule become associated with chromosomes, which was 
thought to be important for two basic taslcs in meiosis, the reduction of ploidy 
5 level and generation of new combinations of genes (reviewed in von Wetlstein 
et aL, 1984). The most radical morphological changes occured during 
spermiogenesis (Oalcberg, 1 956b). Based on acrosome fonnation and nuclear 
condensation, the round spemiid differentiated into mature spennatozoa which 
released into the lumen indicating the end of the spermatogenesis (rBviewed 

10 in Bellve and O'Brien, 1983). 

The proliferation and differentiation of germ cells in the seminiferous 
tubule has been demonstrated to be associated with adjacent somatic ceils, the 
Sertoli cells (reviewed in Russel and Griswold, 1993; Griswold, 1995; Bitgood 
et aL, 1996; de Miguel et al., 1997). The tubule of human testis is composed 

15 of germ cells in all stages of spermatogenesis. From basal membrane to lumen 
are cells ranging from stem cell to differentiated cell state and from proliferation 
to differentiation stage. Particular cell at particular developmental stage is 
associated with others at given sites in the tubule, the least mature stem cells 
are located furthest outside, while the most mature spennatozoa are located 

20 inside the seminiferous epithelium. The specific tubule organization and cellular 
association thus provide a good model system to study germ cell development. 

Genes expressed in male germ cells had been identified in high 
mammals (Kramer and Erickson, 1982; Gold et al., 1983; Thomas et al., 1989; 
Wolgemuth and Watrin, 1991; Hoog, 1995). Some of these genes were 

25 expressed exclusively during spemnatogenesis (reviewed in Willison and 
Ashworth, 1987; Erickson, 1990; Sandlowetal., 1997; van Roijen et al., 1998), 
while others were expressed only in the later stages of spemiatogenesis 
(Shankar et al., 1998; Zhu and Naz, 1998). IHower, very few genes had been 
demonstrated that was expressed specifically in the eariy stage of 

30 spermatogenesis, in the stem cells (Menke et al., 1997). 
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To study the mechanisms of p/Vw family function in germline stem cells 
In various organisms during evolution and especially its role in human beings, 
molecular cloning and expression analysis of human p/w/ family homolog was 
performed. 

5 5>RACE . A 2.3kb partial human piwi family cONA clone was first 

isolated from adult human testis cDNA library through library screening. To 
obtain the 5' end of this gene, a cDNA synthesis primer and a gene specific 
primer has been designed. Adult human testis Poly A+ RNA was purchased 
from Clontech, Palo Alto, California. First-strand cDNA was obtained by using 

10 the above cDNA synthesis primer and AI\AV reverse transcriptase (Clontech, 
Palo Alto, California). Second-strand synthesis cDNA synthesis was performed 
with second-strand enzymes bought fix>m Clontech, Palo Alto, California 
subsequently. Using the above gene specific primer and an adaptor primer 
obtained from Clontech, Palo Alto, California, an 1.4kb fragment has been 

1 5 obtained through 5'-RACE PCR by using MarathonTM cDNA Amplification Kit 
(Clontech, Palo Alto, Califomia) and has been cloned into a pGEM-T Easy 
vector (Promega, Madison, Wisconsin) subsequently. 

Construction of a full length human piwi homoloQ and its sequence 
analvsis . The fully-length hiwi cDNA was generated in a modified pGEM-T 

20 Easy vector (Promega, Madison, Wisconsin) by partial digestion of above two 
overiapping cDNA fragments with Sad and subsequent ligation with T4 DNA 
ligase (Gibco). The full length human p/w/ family homolog was sequenced fully 
on both strands using an ABI 377 Prism^*^ DNA Sequencer (Perkin-Elmer, 
Applied Biosystems) and analyzed using the DNAStar^*^ sequence analysis 

25 package (DNASTAR, Madison, Wisconsin). Homologous comparison between 
hiwi and its homologs was canied out using the Megalign^*^ program from the 
DNAStar™ package. 

Chromosome mappino . Radiation Hybrid mapping against Stanford G3 
Human Hamster panel was used for chromosome mapping for hiwi. 

30 Northern Blot Analysis . A 0.9kb EST fragment used for Northern blot 

analysis was purchased from ATCC (American Type Culture Collection) and 
was subcloned into a pBlueScript KS(+) vector (Stratagene) with EcoRI and 
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Hindlll. The 3.47kb plasmid DNA obtained from 5'RACE was used to make 
sense and antisense RNA probes for in situ hybridization. The above plasmid 
DNA were linearized with appropriate restriction enzymes. Using these 
linearized template DNA, sense and antisense RNA probes for Northem blot 
5 analysis and in situ hybridization were transcribed from the T7 and SP6 
promoters with appropriate T7 and SP6 RNA polymerase (Gibco), following the 
protocols suggested by manufacturers. Probes used for Northem blot analysis 
were labeled by ^P-UTP (Amersham). 

Premade Northem blot of high-quality poly A+ RNAfrom different human 

1 0 tissues (Human Multiple Tissue Nothem Blot II) was purchased from Clontech, 
Palo Alto, California. Human O-actin cDNA probe (Clontech, Palo Alto, 
California) was used as a positive control. ExpressHyb^*^ Hybridization Solution 
(Clontech, Palo Alto, Califomia) was used as hybridization buffer for Northem 
analysis. The hybridization for both sample blot and positive control was 

1 5 performed under 68*C for 2 hours followed by 2 washes in 2x SSC, 0.05% SDS 
for 50 minutes at room temperature, and another 2 washes in 0.1x SSC, 0.1% 
SDS for 40 minutes at 50*C. The sample blot was exposed at -80*C for 10 
hours while the positive control was exposed for 2 hours at the same 
temperature. 

20 Cloning of fulNenath human d/w/ cDNA clone . An isolated 2.3 kb partial 

human piwi cDNA clone from a human testis library by library screening is 
described in Example 1. To obtain the 5' end of this gene, by using a gene- 
specific primer, a 1 .4 kb cDN A fragment containing overlapping sequence was 
obtained thn^ugh 5*RACE and subcloned. To obtain a full length human piwi 

25 fomily done, the above overiapping cDNA clones were partially digested. 
Fragments of the conect sizes were subsequently ligated and subcloned into 
a pGEM-T Easy vector (Figure 4) to generate the 3472 bp full-length human 
pnv/famiiy (hiwi) that includes an open reading frame (ORF) encoding a protein 
of 861 amino acids, with a predicted molecular weight of 94.71 kDa (Fig. 7 and 

30 Fig. 8). The ORF starts at 167bp and stops at 2749bp. Upstream of the start 
codon is a 1 66bp 5'untranslated region (5'UTR). Downstream of the stop codon 
is a 723bp 3*untranslated region (3'UTR). 
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Proteln sequence analysis showed that HIWI is 36.6% identical to 
Drosophila PIWI at amino acid level over its full length and 58.7% identical at 
the C terminal 1 04 amino acid residues. HIWI also shares 33.4% and 33.1 % 
identity with two C. elegans PIWI family homologs PRG-1 and PRG-2 over Its 
5 full length while the percentage increased to 57.7% and 58.7% at the C- 
temiinals, suggesting p/vw family genes and gene products are conserved 
during evolution. In addition, the higher degree of sequence homology between 
the C temiinals suggests a conserved role of the C-tenminal domain of PIWI in 
protein function. 

10 Chromosomal maopina of hiwL By using Radiation Hybrid mapping 

method against the Stanford G3 panel, human piwi was mapped to 
chromosome 1 2q 24. 33, approximately 1 30kb from the AFM 295 ye5 marker. 

Tissue distribution of hiwi in adult human . Northern blot analysis were 
performed to study the tissue distribution of hiwi in adult human tissues. A 

15 variety of human tissues were examined, including spleen, thymus, prostate, 
testis, ovary, small intestine, colon and peripheral blood leukocyte. Among 
them, two transcripts were detected in testis: a more abundant 3.6kb band and 
a less abundant band of 8.1 kb. No expression was found in other analyzed 
tissues, consistent with the known functions of hiwi in GSC division, 

20 An autoradiograph of Northern blot analysis of hiwi mRNA in adult 

human and mouse tissues was thus prepared. The expression of hiwi mRNA 
In different tissues: spleen; thymus; prostate; testis; ovary; small intestine; 
colon; peripheral blood leukocyte. Two A/m^' transcripts of 3.6kb and B.lkb are 
visible in testis. Human &-Qctin cDNA probe was used as a positive control for 

25 the Northern blot with above tissues. 

The specific expressbn of hiwi 'm the testis is contemplated to modulate 
the mechanism of some human genetic diseases related to testis malignancy 
and/or infertility, hiwi and other mammalian p/ivr family genes play an important 
role in germline stem cell division and other stem cell-related processes and 

30 are thus contemplated to be useful in the therapeutic methods, the screening 
methods, and the diagnostic and prognostic methods of the present invention 
set forth herein. 
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Example 3 

Molecular Cloning and Expression Analysis of miwi. 
a Murine Homoloo of the Drosophila oiwi Gene 
SEQ ID N0s:3 and 4 depict the cDNA and protein sequences of m/iv/. 
5 the murine homolog of the p/w/ gene in the p/vw family. These sequences were 
purified and isolated as described in Examples 1 and 2 using a murine testis 
cDNA library and murine testis mRNA as starting materials. The genomic locus 
ofmiwiis depicted schematically in Figure 5. Additionally, homology alignment 
between the PIWI. HIWI, MIWI. PRG-1 and PRG-2 proteins is anaylzed in 
10 Figures. 

RNA in situ hybridization of a miwi probe to sections of wildtype mouse 
testis was also perfomied. miwi is expressed specifically in the gemriiine stem 
cell region in the mouse testis at specific stages of the seminiferous tubule 
cycle. Color photographs of RNA in situ hybridization of a miwi probe to 

1 5 sectioned wildtype mouse testis were prepared. A measurement bar denoting 
40^m was employed in each photograph. 

The murine p/w/ family gene and gene product (miwi) thus also plays a 
role in germline stem cell renewal and in other developmental processes based 
on the following important data and reasoning. First, miwi is more homologous 

20 to piwi than prg-l, prg-ll, Zwille and ago to p/w/ (see Example 1 and Figure 6). 
Next, murine germline stem cells are much more stmilarto Drosophilia germline 
cells than to the C. elegans genniine or Arabidopsis meristem cells. Further, 
the function of piwi family genes are well conserved among Drosophilia, C. 
elegans, and Arabidopsis, which are evolutionarily more diverse than between 

25 Drosophilia and human/mammals, miwi is further shown to be expressed 
specifically in the genniine stem cell region in the mouse testis. Finally, the 
specific expression pattem of miwi in the adult is in the murine testis, but not 
in the adult murine ovary which do not contain gemniine stem cells. 

Therefore, miwi, as well as hiwiand other mammalian p/w/family genes, 

30 play an essential role in germline stem cell division and other stem cell-related 
processes and are thus contemplated to be useful in the therapeutic methods. 
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the screening methods, and the diagnostic and prognostic methods of the 
present invention set forth herein. 

Example 4 

PiWI Family Protei ns are Nuclear Proteins Whose Level 
5 Determines the Number of Stem Cells In the Gemiline 

Two immunofluorescence micrographs were prepared and showed 
germline cells (marked In green) in the Drosophila gemiarium in the 
background of nomial and elevated level of PIWI . The ectopically elevated 
expression of pnv/ causes an increase in the number of stem cells and/or their 

1 0 immediate daughter cells (cystoblasts) from nomnally 2-3 to approximately 1 5, 
with 7 of which were shown in the focal plan of micrograph of elevated PIWI 
levels. Stem cells were marked by the spectrosome (Sp), an organelle specific 
to these cells. The differentiating germline cyst contains fusorne (F), an 
organelle specific to mitotic stage of cyst development. Thus, over-expression 

15 of plwi increases the number of germline stem cells/cystoblasts. 

Immunofluorescence micrographs of detection of a myc-tagged PIWI 
protein (with and without myc as a control) in Drosophila ovariole, Drosophila 
tenninal filament, basal cells and GSC in Drosophila gemnarium, apical polar 
cells (APC) in Drosophila egg chambers, and hub cells in Drosophila testis 

20 were also prepared. These mk:rographs showed that PIWI is a nuclear protein 
present in both the gemriline and in somatic cells at different stages of 
oogenesis and spennatogenesis. myo-PIWI was shown to completely rescue 
the defects caused by loss of p/w/ function, thus demonstrating a therapeutic 
method of the present inventton. PIWI is demonstrated to be a nuclear protein 

25 at certain development stages. MIWi and HIWI proteins are thus also 
contemplated to be nuclear proteins at certain development stages. 

Additionally, as disclosed below, the PIWI protein can localize either in 
the nucleoplasm or cytoplasm, the dose of the PIWI protein detemriines both 
the number and the mitotic rate of germline stem cells, and the PIWI protein is 

30 localized in the gennplasm (the cytoplasm of future gemi cells) and plays a key 
role in determining the initial number of germ cells in during embryogenesis. 
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The mouse PIWI protein (i.e., MiWI protein) has also been observed in the 
cytoplasm via antibody screening and is thus also envisioned to be a 
cytoplasmic factor with very similar functions in gerni cell and stem cell 
detennination. Thus, the HIWI protein is thus also envisioned to be a 
5 cytoplasmic protein at certain development stages. 

Example 5 

Dhm Encod es a Nudeooiasmic Factor Whose Activity Modulates the 
Number and Division Rate of Germline Stem Cells 
As disclosed above, piwi represents the first class of genes known to 
be required for stem cell self-renewal in diverse organisms. In the Drosophila 
ovary, piwi is required in somatic signaling cells to maintain germline stem 
cells. In this Example it is shown that that p/Vw encodes a novel nucleoplasmic 
protein present in both somatic and germline cells, with the highly conserved 
C-temiinal region essential for its function. Removing PIWI protein from single 
germline stem cells significantly decreases the rate of their division. This is 
indicative of a second role for PIWI as a cell-autonomous pronwterof gennline 
stem cell division. Consistent with its dual function, over-expression of piwi in 
somatic cells causes an increase both in the number of germline stem cells and 
the rate of their division. Thus, PIWI is a key regulator of stem cell division-its 
somatic expression modulates the number of gemnllne stem cells and the rate 
of their division while ite germline expression also contributes to promote stem 
cell diviston in a cell-autonomous manner. 

Materials and Methods 
DmsoDhila strains and cultura. All Drosophila strains were grown at 
25*0 on yeast-conteining molasses/agar medium. The following fly strains 
were used in this study: pnw^ and piw^ mutent chromosomes, generated in 
different genetic screens (Lin and Spradling, 1997), were dominantly marked 
with Irregular fiacets (If) (Lindsley and ZImm. 1992); ep(2)1024 is an EP 
element insertion (Rorth, 1996; Rorth et al., 1998) in the piwi 5' UTR; 
30 p/vw^FRT40A/CyO and p/Vw2FRT40A/C>O; hsFlp; + FRT40A/CyO (Xu and 
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Rubin, 1993); hsFlp; amadilMacZ FRT40A {armlacZ FRT40A. Lecuit and 
Cohen, 1997); hsGal4 (Brand and Pemmon, 1993) are as described herein 
above. 

Generation of mutant oemiline stem cell clones, pwi mutant GSC 
5 clones were generated by Flp-mediated recombination as described by Xu and 
Rubin (1993). To generate wild-type and mutant GSC clones for analyses, 
piwi^ FRT40A/CyO, p/iv/^FRT40A/CyO and p/m2FRT40A/CyO males were 
mated to w hsFlpl ; armlacZ FRT40A virgin females, respectively, to produce 
IV hsFlpl/^; piwi* FRT40A/ armlacZ FRT40A progeny, where * denotes the 

1 0 wildtype or mutant alleles. The parental adults were transferred to fresh vials 
after 2 days. Larvae from the original vials were heat-shocked once daily for 
1 hour in a 37'C water bath on days 3 and 4 to induce mitotic recombination 
immediately prior to the onset of oogenesis at the late third instar stage as 
described herein above. After eclosion, adult w hsFlp1/+; piwS* 

15 FRT40A/am7/acZ FRT40A females were transferred to fresh food at room 
temperature and ovaries were removed 1 week, 2 weeks, and 3 weeks after 
the last heat shock treatment and processed for anti-lacZ antibody staining to 
look for lacZ-negative clones. 

Heat-shock induced PIWI overexpression. To construct a piwh 

20 overexpressing stock, ep(2)1024/CyO virgin females were crossed to males 
homozygous for a hsGal4 transgene on chromosome 3. Females canying both 
the ep(2)1024 chromosome and the hsGal4 transgene were heat shocked at 
37*C for 1 hour each time with an Interval of 1 2 hours for 3-4 days. At the end 
of the heat shock regime, ovaries were dissected and processed for antibody 

25 staining. 

To analyze tiie functionality of the PIWI protein produced by teGa/4- 
Induced overexpression of ep(2)1024, ep(2)1024/CyO; hsGal4/hsGal4 virgin 
females were crossed to pm^lf/CyO; +/+ males to produce the 
ep(2; f /?5Ga/4/+ transheterozygous progeny. After2days, parental 

30 adults were transferred to fresh vials. Larvae were heat shocked daily for 1 
hour at 37*C starting at day 3 and continued to 3 days after eclosion. The 
transheterozygous progeny were subsequentiy dissected and processed for 
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antibody staining. The transheterozygous progeny which were not subjected 
to the heat shoci; regime served as controls. 

Immunofluo rescence microscoov and BrdU labeiing. Ovaries and testes 
were dissected, fixed and steined as described by Lin et al. (1994). For 
5 immunofluorescence staining the tbilowing antisera were used: polyclonal anti- 
Vasa antibody (1 :2000; Hay et al., (1 990)), monoclonal anti-Hts antibody 1 B1 
(1:1; Zaccai and Lipshitz, (1996)), polyclonal anti-a-spectrin antibody (1:200; 
Byer et al.. (1987)), monoclonal anti-myc epitope antibody 1-9E10.2 (1:50; 
Evan et al., 1985), monoclonal anti-BamC antibody (1:1000; McKearin and 

1 0 Ohistein, 1 995), polyclonal anti-li-galactosldase antibody (1 :600; Cappel, Costa 
Mesa, Califomia), monoclonal anti-BrdU antibody (1:50; Becton-Dickinson, 
Franklin Lakes, New Jersey). All the fluorescence-conjugated secondary 
antibodies were from Jackson Immunoresearch Laboratory (Westgrove, 
Pennsylvania) and were used at 1:200 dilution. 

1 5 Immuno-fluorescently labeled samples were also counter-stained with 

DAPI as described previously (Lin and Spradling, 1993). Micrographs were 
taken using either a Zeiss AXIOPLAN® microscope or a Zeiss LSM410™ 
confbcal microscope (Zeiss, Oberkochen, Gennany) as described herein 
above. BrdU labeling was perfonned essentially as described in Gonczy and 

20 DiNardo (1996). Briefly, on day 3 of the heat shock regime, ep(2)1024; 
hsGAL4 females were transferred to an Eppendorf tube that was held 
horia>ntelly and contained 100 yil of 100 mM BrdU in grape juice. The flies 
were fed (pulse) with BrdU for 1 hour at room temperature. After the pulse, 
flies were retumed to fresh vials without BrdU for 1 2 hours at room temperature 

25 (chase). These females were subsequently dissected and fixed for 
immunofluorescence labeling with anti-BrdU and anti-a-spectrin antibodies. 

Constmctton of tran saenes encoding N- and C-terminal-taaaed PIWI 
proteins. Synthetic oligonudeotkles (Gibco BRL) encoding a myc-epitope were 
cloned into unique sites at the N- and C-temninus of the p/m ORF in a 6.8 kb 

30 piwi genomic constmct (pRcl 2) that fully rescues the piwi activity (Examples 
1 -4 above and Cox et al., 1 998) as follows: For the N-terminal myc insertion, 
the following primers were used: Bcl/myc: 5'-GAT CAT ATG GAG CAA AAG 
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CTT ATT AGC GAG GAA GAT CTG AAT-3' (SEQ ID N0:7) and Bcl/antisense 
myc: 5'-GAT CAT TCA GAT CTT CCT CGC TAA TAA GCT TTT GOT CCA 
TAT-3' (SEQ ID NO:8). The primers were annealed accoixling to the 
nnanufacturer's recommendations and cloned into a unique Bell recognition site 
5 in a Bluescript done containing piw genomic DNA. 

The resulting recombinant myc-p'wi gene was cloned into pCasper 4 
(Pinnotta, 1988) to produce the plasmid pPMB1-6 in which the myc sequence 
is Inserted between PIWI amino acid residues 3 and 4. For the C-tenninal myc 
insertion, the following primers were used: Nhe/myc: 5'-CTA GCA TAT GAG 

10 CAA AAG CTT ATT AGC GAG GAA GAT CTG AAT AAG-3* (SEQ ID N0:9) 
and Nhe/antisense myc: 5'-CTA GCT TAT TCA GAT CTT CCT CGC TAA TAA 
GCT TTT GCT CAT ATG-3' (SEQ ID NO:10). Primers were annealed as 
described above and cloned into a unique Nhel recognition site in a Bluescript 
clone containing part of p/vw genomic DNA. The recombinant insert was cloned 

1 5 into pCasper 4 (Pinotta, 1 988) to produce the plasmid pPMN in which myc is 
Inserted between PIWI amino acid residues 795 and 796. The insertions were 
confimned by DNA sequendng. 

P-element mediated aemfiilne transformation. Transgenic flies were 
produced according to Spradling and Rubin, 1982, using w; 2-3 Sb em/16 e 

20 (Roberteon et al., 1988) embryos as recipients. Eight independent pPI\/IB1-6 
lines and twelve independent pPMN lines were recovered. The myc-tagged 
transgenes were then separated ftom the 2-3 transposase and introduced into 
ttie homozygous p/nw^ background torfertility tests by genetic crosses. Rescue 
crosses were carried out at 25'C. Each transgene was further tested for Myc 

25 expression by whole-mount immunofluorescence. 

Immunoblottlna analysis and in vibo translation To prepare total protein 
extracts from pPMBI -6 transgenicflies, ovaries were dissected in 1 X EBR (1 30 
mM NaCI; 5 mM KCI; 2 mM CaClj; 10 mM Hepes pH 6.9) and homogenized in 
IX sample buffer (5% 2-mercaptoethanol, 3% SDS, 10% glycerol. 62,5 mM 

30 TrisCI pH 6.8, 0.1% bromophenol blue). Protein samples were boiled 5 
minutes at 95*C and then loaded on 10% SDS-PAGE gels. After 
electrophoresis, gels were electro-transfenred to Genescreen (NEN Research 
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Products, DuPont, Wilmington. Delaware) In a TRANS-BLOT® apparatus (Blo- 
Rad Laboratories, inc.. Hercules. Califoniia). Blots were blocked for 1 hour in 
Blotto (5% non-fat dry milk in IX PBS) and primary antibodies were 
subsequentiy diluted in Blotto followed by an incubation overnight at 4*C. The 
5 nranoclonal 9E1 0 anti-myc antibody (Evan et al.. (1 985)) was used at a dilution 
of 1:50. HRP-conjugated anti-mouse IgG secondary antibody (Jackson 
Immunoresearch Laboratory, Westgrove, Pennsylvania) was used at 1:4000. 
Blots were washed and processed for detection using enhanced 
chemilumlnescent (ECL) reagents as recommended by the manufacturer 

10 (Amersham, Buckinghamshire, England). In vHro translation analysis was 
performed on the plasmid pDC2 to produce ^S-labeled PIWI protein according 
to the manufacturer's recommendations (Promega, Madison. Wisconsin). The 
translation products were separated on a 10% SDS-PAGE gel and images 
were collected by exposing the gel to X-ray film. 

1 5 Quantitation of oennline stem cell division rate. Relative division rate of 

piwt GSCs was determined as follows: The number of marked piwr cysts in 
30 germaria was counted to obtain the average number of the piwr cysts per 
gennarium. This number is then divided by the average number of marked 
wild-type cysts per germarium as obtained from counting 20 gemiaria. The 

20 relative division rate of GSCs in the p/wZ-overexpressing germaria and that in 
the wild-type and un-induced ep(2)1024 control gennaria was determined by 
counting the flrequency of telophase spectrosomes contacting the cap cells in 
the 2-3 GSCs in the most apical region of the germarium. 

Results 

25 PIWI is a nuclear protein present in both the soma and oenmline in both 

sexes. Examples 1-4 atK>ve demonstrate that p/vw encodes a 97.2 kDa novel 
protein required for the self-renewing division of GSCs in both males and 
females. To elucidate further the function of p/m in regulating GSC division, 
the expression and subcellular behavior of the PIWI protein was studied in v/Vo. 

30 The PIWI protein was tagged by inserting a sequence encoding a myc epitope 
(Evan et al., (1 985)) into the piwi gene, at the 5* end of the piwi open reading 
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fifame (Figure 2). The p[5 -myc-piw/| transgene (denoted as myc-piwi) was 
introduced into Drosophlla via P-element mediated gemnline transfonriation 
(Spradling and Rubin. (1982)). myc-piwi fully restores the fertility and 
gametogenesis oiphNi^ mutant males and females (Table 4). Thus, the myc- 
5 PIWI protein confers wild-type PIWI function. In contrast, a p|p/WF3 -myc] 
transgene (denoted as piwi-myc) with the myc sequence inserted in the highly 
conserved C-terminal region (Fig. 7) fails to rescue any p/W/ mutant phenotype 
(Table 4). The PI Wl-myc protein was not detectable either by Western blotting 
analysis or by immunostaining of the transgenic ovaries (Table 4), indicating 
10 that this region is essential at least for the stability of the PIWI protein. 
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The expression of myc-PIWI was confirmed by immunoblot analysis 
using a monoclonal anti-myc antibody as a probe, wliich reveals a single 97*5 
kDa band in tlie ovarian extract of myc-piwifl\es. This agrees well with both the 
97.2 kDa predicted size of PIWI and the 97.1 kDa observed size of PIWI vifhen 
5 a ful Wength piwi cDNA was translated in vitro. Furthermore, eight Independent 
myC'piwi transgenic lines share the identical subcellular localization pattern 
throughout gametogenesis. Thus, the myc-PIWI protein reflects the function 
and distribution of endogenous PIWI. 

The expression and localization pattern of myc-PIWI during 

10 gametogenesis was also examined by immunofluorescence microscopy. 
Whole mount ovaries and testes isolated firom wild-type flies canying a myc- 
p/w/transgene were stained with the anti-myc antibody to visualize myc-PIWI; 
this antibody did not detect any signal above the background in ovaries and 
testes from siblings without myc-piwi. The samples were also stained with anti- 

15 VASA antibody to highlight germ cells and DAPI to label nuclear DNA. In the 
third instar larval ovary, where GSCs begin their asymmetric divisions, myc- 
PIWI was found In the nucleus of all genu cells of the ovary. In addition, myc- 
PtWI was also present in the nuclei of the forming temiinal filament cells. In 
adult ovaries, myc-PIWI is present in the nuclei of both the somatic and 

20 gemnline cells, as predicted by piwi RNA in situ analyses described in 
Examples 1-4. Specifically, in the genmarium, myc-PIWI is expressed in all the 
somatic cells, including the main terminal filament cells, the cap cells and the 
inner sheath cells, consistent with the genetic clonal analyses presented herein 
above which indicated that piwI is required in these cells to maintain GSCs. 

25 PIWI is also expressed in somatic stem cells and the follicle cells in the 
germarium. 

In the gemnline, myc-PIWI shows a dynamic nuclear expression pattern: 
it is present abundantly in GSCs, but is down-regulated In cystoblasts and 
developing cysts. In 16-cell cysts in regions 2 and 3, the myc-PIWI regains its 
30 high level expression, and remains so in nurse cells and oocytes in post- 
gennarial egg chambers throughout oogenesis. This expression is consistent 
with the clonal and RNA in situ analyses presented herein above which Indicate 
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that PiWI is also loaded in eggs as a maternal component essential for 
embryogenesis. In all post-gennarial egg chambers, myc-PIWI is also 
specifically expressed in the anterior polar follicle cells. The consistent nuclear 
localization of myc-PIWI in various types of somatic and gemnline cells during 
5 oogenesis indicates that PIWI is a nuclear protein. 

Therefore, PIWI is a nuclear protein present in the soma and germline 
of both sexes. In the third instar larval ovaries, PIWI is present in the nucleus 
of all germline cells (GSC) as well as in forming tenninal filament cells (TFC) 
as revealed by myc-PIWI in red and by VASA. a germline-specific marker, in 

10 green. PIWI is expressed in a dynamic pattern in the adult gemnarium (Ge), 
while being weakly expressed in main terminal filament cells and strongly 
expressed in cap ceils (CpC), In the soma, PIWI is strongly expressed in the 
cap cells, inner sheath cells (ISC), somatic stem cells (SSC), and follicle cells 
(FC). In the germline, PIWI is expressed in gemnline stem cells (GSC) and is 

1 5 downregulated in cystoblasts (Cb) and eariy mitotic cysts. PIWI regains its high 
level expression in differentiating 16-cell cysts (DC) in regions 2b and 3. 

In post-germarial egg chambers, PIWI is expressed strongly in all nurse 
cell nuclei (NCN) as well as in the nucleus (genminal vesicle, GV) of the 
developing oocyte. In the somatic follicle cell layer around each egg chamber, 

20 PIWI is expressed in anterior polar follicle cells (APC). The sharp down 
regulation of PIWI in cystoblast is also evident in the germarium, in which the 
three PlWI-bright small nuclei at the germarial apex are cap cell nuclei. The 
two larger nuclei in the next layer are GSC nuclei. Other small PlWI-bright 
nuclei in region 1 are inner sheath nudei. The PlWI-negative region starts with 

25 a cystoblast positioned in the third layer. 

In the third instar larval testis, PIWI, is present in the nuclei of both 
somatic and germline cells. In the soma, PIWI is expressed in the nuclei of hub 
cells (Hub) apical to GSCs and in cyst progenitor cells (CPC). In the genmline, 
PIWI is present in the nuclei of GSCs and their immediate daughters. The 

30 apical region of the adult testis shows the same apical-distal organization as 
the third instar larval testis. PIWI expression pattem in the adult testis is 
identical to that in the third instar larval testis. PIWI is expressed in hub cells, 
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GSC, and CPC. Ovaries from sibling females which do not cany the myc-piwi 
transgene show no specific staining with the anti-Myc antibody. 

PIWI is also expressed as a nuclear protein durino sperniatooenesis. 
I n the third instar larval testes which contain mostly premeiotic genu cells, myc- 
5 PIWI is localized to the nuclei of apical somatic cells, including the hub cells, 
which are the testicular equivalent of terminal filament cells. myc-PIWI is also 
present in somatic stem cells and their progeny, the cyst progenitor. At the 
apical germline, PIWI is present in the nuclei of GSCs and their immediate 
daughter cells. In newly formed 16-cell cysts of primary spermatocytes 

1 0 somewhat away from the apex, the myc-PIWI staining is sharply reduced in the 
germline. The staining is only present in the cyst progenitor cells, which are 
equivalent to follicle cells in the ovary. Once the developing cyst enters the 
spermatocyte growth phase, myc-PIWI expression is completely undetectable. 
This pattem of expression is maintained in the adult testis which displays the 

15 same apical-distal oi^anization but now contains more differentiated post- 
meiotic germ cells in the basal region of the testis. Given the role of piwi in 
testicular gemiline stem cell maintenance described herein above, the nuclear 
localization of PIWI in the testis is envisioned to be of functional importance. 
PIWI is localized in the nucleoplasm. To further investigate the role of 

20 PIWI in the nucleus, the subnuclear localization of the myc-PIWI protein was 
examined. In interphase GSCs, myc-PIWI is present in subnuclear areas 
where DNA staining is the weakest. The myc-PIWI staining appears as 
numerous brightfoci in DNA-deficient areas that are sometimes connected into 
a network, complementing the DNA staining pattem. The DNA-bright area is 

25 completely devoid of myc-PIWI. In interphase cap cells, myc-PIWI shows a 
similar staining pattem. This staining pattern suggests that, in interphase 
somatic and gennline nuclei, myc-PIWI is not associated with the chromatin or 
the nuclear envelope, but is localized in the nucleoplasm. 

The nucleoplasmic localization of myc-PIWI is even more evident during 

30 GSC division. At metaphase, myc-PIWI is not associated with mitotic 
chromosomes or the nuciearenvelope area, butbecomes scattered throughout 
the cytoplasm, fomning concentrated foci at the cell cortex. The localization 
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pattem remains so throughout anaphase. This dynamic localization pattern of 
myc-PIWI verifies that PIWI is not associated with chromosomes orthe nuclear 
envelope, but is a nucleopiasmic factor that is dispersed into the cytoplasm 
during mitosis. 

5 The myc-PIWI protein also shows clear nucleopiasmic localization in the 

nuclei of nurse cells, oocytes, and follicle cells in developing egg chambers. 
In nurse cell nuclei, myc-PIWI is accumulated in subnuclear areas devoid of 
DNA staining, complementing the DNA localization pattem. This 
complementary relationship is most clearly illustrated in the oocyte nucleus 

10 (genninal vesicle), in which chromatin forms a discrete subnuclear structure 
called the karyosome that occupies a spherical area in the nucleus (King, 
1970). The myc-PIWI staining is not detectable in the karyosome, but is 
conspicuously present in the nucleoplasm that surrounds the karyosome. In 
follicle cells, myc-PIWI is also localized in the nucleoplasm in a punctate 

15 fashion. Thus, PIWI is a nucleopiasmic factor in all types of ovarian cells. 

Therefore, PIWI is localized in the nucleoplasm. In the interphase 
germline stem cell nucleus, myc-PIWI mostly accumulates as bright foci in 
DNA-deficlent areas within the nucleus. DNA-bright spots were observed to 
be are completely devoid of myc-PIWI. This complementary staining pattem 

20 is also observed in cap cells (CpC). 

in metaphase and anaphase germline stem cells, the myc-PIWI staining 
is dispersed throughout the cytoplasm, forming numerous foci at the cortical 
region of the cell. In a stage 4 egg chamber (for staging, see King, 1 970), myc- 
PIWI is accumulated in regions of nurse cell nuclei (NCN) and follide cell nuclei 

25 (FCN) devoid of DNA staining. In the germinal vesicle (GV), the myc-PIWI 
staining is absent from the karyosonne (K) but present in the nucleoplasm that 
surrounds it. The oocyte region of a stage 6 egg chamber showing the 
complementary myc-PIWI and DNA staining in the germinal vesicle (GV) and 
follicle cell nuclei (FCN). A magnified view of a nurse cell nuclei (NCN) in a 

30 stage 6 egg chamber, showing the largely non-overiapping staining pattem of 
myc-PIWI and DNA. 
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PIWI is cell-autonomouslv required in aermline s tem cells to promote 
their division. The above-presented clonal analyses indicated that piwi 
functions in the apical somatic cells of the germarium to regulate GSC 
maintenance. However, as PIWI is also present in the nuclei of GSCs, the 
5 function of PIWI in GSCs was evaluated by removing PIWI flnom a single stem 
cell using the FLP/FRT-mediated clonal technique (Xu and Rubin, 1 993). piwi^ 
and piwi^ mutations were used for generating piwr deficient clones because 
they are both strong mutations, as described in Examples 1-4. The piwr stem 
cell clones were induced in piwi fRVarmlacZ FRT transheterozygous flies. 

10 The am7/acZ transgene (Lecuit and Cohen, 1997) is expressed in all 

somatic and germline cells in the germarium. piwi deficient clones created by 
a mitotic recombination no longer contain the armlaoZ gene and can thus be 
identified by the absence of the lacZ protein • The control lacZ-negative piwi^ 
gennline clones were induced in piwi^ FRT/armlacZ FRT transheterozygous 

1 5 flies for comparison. Since it takes four days for a cystoblast to develop into 
an egg chamberthat buds off the gemnarium (Margolis and Spradling, (1 995)), 
among p/vw-deficient clones, only GSC clones can persist over more than five 
days (Figure 5B). If no p/wr GSC is observed five days after induced mitotic 
recombination, this would dearly indicate the failure of p/wf stem cells to 

20 maintain themselves. 

Because recombination events can only occur in mitotically active cells 
yet all somatic signaling cells are postmitotic, this approach allows the analysis 
of the cell-autonomous function of p/iv/in single GSCs without the complication 
of generating mutant clones in the signaling ceils. However, even if any 

25 somatic clones are induced, they can be readily identified. Furthemiore, by 
marking individual mutant and wild-type GSCs, ttie relative division rate and 
maintenance ability of each stem cell can be detennined by counting the 
number of its cyst progeny within the gemiarium (also see Xie and Spradling, 
1998). Finally, the cell-autonomous function of piwi in the genmline can be 

30 assessed at all stages of oogenesis by examining the development of the 
Individually marked p/wr gemniine cysts and egg chambers. 
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Mitotic recx)mbination was induced immediately prior to oogenesi$ and 
looked for p/wr GSC clones one week, two weeks, and three weeks following 
oogenesis, piw'r GSCs were present even three weeks following oogenesis, 
consistent with the above-presented observation that PiWI in GSCs is not 
5 required for their maintenance. 

The number of marked piwr and piwi'^ gemiline cysts was then 
examined in tester and control germaria to compare the division rate between 
the piw'r and piwi'^ GSCs within the germarium. For accurate comparison, 
germaria that contain only two GSCs, one mari^ed and one unmarked, were 

10 always examined. In wild-type control germaria, marked wild-type stem cells 
gave rise to approximately 50% of the cysts present within the germarium. This 
indicated that the /acZ-marked and unmarked wild-type stem cells divide at the 
same rate (Figure 5). However, in tester germaria, the number of the marked 
piwr germline cysts is consistently only 25% of mariced wild-type cysts (Figure 

15 5). This analysis reveals that piwt GSCs divided four-fold slower than wild- 
type GSCs. Thus, in addition to its somatic function, PIWI acts cell- 
autonomously in the stem cells to facilitate their division. 

The p/wr cysts and postgemiarial piw'r egg chambers were examined 
by OAPI staining and by Nomarski microscopy for potential developmental 

20 defects. They usually developed normally, suggesting that p/m does not play 
a cell-autonomous role In genmline cyst development and subsequent stages 
of oogenesis. 

Therefore, PIWI acts cell-autonomously in GSCs to promote their 
division. Germaria were labeled with anti-L^cZ antibody for p/iv/^ cells and 

25 anti-IBI antibody forspectrosomesandfusomes. Merited stem cells and cysts 
developed from p/wr germline stem cells (GSCs) are recognized by the 
absence of lacZ expression. In a phv^ tecZFRT/p/wr'" fecZ^ FRT gemnarium 
in the absence of heat shock, no p/wr" /acZFRT/piwf /acZFRT clones were 
induced. GSCs are distinguished by the presence of the spectrosome (Sp) in 

30 direct contact with the terminal filament cells (TFC). 

A pwr lacZfRTfpiwi'^ lacZ* FRT gemnarium was stained one week 
after heat shock, and showed a wild-type GSC and a mutant GSC, as well as 
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three piwr cysts derived from the mutant stem cell. The entire ovariole is 
enveloped by the epithetial sheath (ES), which is different from the inner 
sheath cells (IS). The graph of Figure 5 displays the relative division rates of 
wild-type and p/m^ GSCs. The number of germane counted is given in 
5 parentheses. Only germaria containing one marl<ed and one unmarked GSC 
were examined. 

Overexpression of PIWI increases the number of oermline stem cells. 
Examples 1-4 above establish that the somatic expression of PIWI is essential 
for GSC maintenance. This somatic induction can be achieved via a threshold 

10 mechanism, in which a certain level of somatic PIWI activity is required to 
establish or maintain the stem cell identity, but a higher level of somatic PIWI 
expression will not increase the number of GSCs. Alternatively, the PIWI- 
mediated somatic induction could act via a dosage-dependent mechanism, in 
which the somatic level of PIWI dictates the number of GSCs. 

15 To test these possibilities, PIWI was overexpressed in the somatic cells 

of the adult ovary via heat shock induction using hspTO-GalA {hsGal4) and 
ep(2)1024, an EP element inserted in the 5' UTR of the piwi locus. EP is a 
modified P element carrying an UASt promoter oriented to transcribe the 
flanking genomic sequence upon the activation of the yeast Gal4 protein 

20 (Rorth, (1996)). Specifically, ep(2)1024 in the 5'UTR of piwi is oriented in the 
same direction of piwi transcription so that it can produce a piwi mRNA 
containing the complete PIWI open reading frame. The UAS^ promoter is 
active in the soma but not in the germline (Rorth, 1998), and its somatic 
specrTicity was confinmed by examining the ovarian expression of the UAS^ - 

25 GFP reporter gene under experimental conditions. Because endogenous PIWI 
is expressed in all somatic cells in the germarium, by constructing a Drosophila 
strain that contains /7sGa/4 and ep(2)1024, itwas possible to over-express, but 
not ectopically express, piwi specifically in the somatic cells by heat shock 
induction. 

30 To verify that ep(2)1024 can express a functional PIWI protein, 

ep(2) 1024 was overexpressed by heat shocking hsGal4 in an ep(2)1024/piwi1 
mutant background. Because ep(2)1024 is a strong loss-of-function mutation 
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of piwi, ep(2)1024lpiwi'f; hsGal4 females and males in the absence of heat 
shock display typical piwi phenotype. In contrast the same mutants, after 
receiving daily heat shock treatment from the second instar lan^ae to three 
days after eclosion, produced significantly restored fertility in both 
5 ep(2) 1O24lpiw0 females and males. The morphology of both the ovaloles and 
testes in the heatshocked mutants are indistinguishable from their wild-type 
siblings. In addition, the fertility of the rescued mutants is comparable to their 
wild-type siblings. Heatshock starting at the pupal stage does not rescue the 
fertility. Therefore, ep(2)1024 can produce a functional PIWI protein capable 

10 of rescuing the piwi mutant phenotype when expressed in somatic cells. 

ep(2)1024 was then used to overexpress PIWI in wild-type adult 
females, and its effects on GSCs were analyzed. The females were heat- 
shocked at a 12-hour interval for three days and then dissected. The ovaries 
were stained with anti-VASA antibody (Hay et al., (1990)) to highlight germ 

15 ceils and anti-1B1 antibody (Zaccai and Upshitz, (1996)) to outline somatic 
ceils as well as to label spectrosomes and fusomes that mark individual stages 
of genmline development in the germarium (Lin and Spradling, (1995)). Wild- 
type genmaria typically have 2-3 spectrosome-containing cells, with one often 
being a cystoblast that is not associated with the terminal filament This Is also 

20 the case for uninduced ep(2)1024l'^ germaria. Interestingly, the number of the 
spectrosome-containing genn cells increased to an average of 7.5 cells per 
ep(2)1024 h germarium following heat shock induction. In the most extreme 
case* up to 15 spectrosome-containing cells were observed in a single 
gennarium. This increase is not due to the expresston of hsGal4 alone, since 

25 ovaries from hsGal4 females subjected to the same heat shock regime were 
indistinguishable from wild-type. Thus, overexpression of PIWI in the soma 
leads to a 3-4 fold increase in the number of GSCs and/or cystoblasts. 

Thus, the observation that EP(2)1024 expression rescues the piwi^ 
mutant phenotype was shown via DAPI staining of the gonads from 

30 ep(2) 1024/piwi^\ hsGAL4 females and males either without or with heat shock. 
In the absence of the heat shock, the ovaries and testes display the typical 
piwi^ mutant phenotype. Following /75Ga/4-induced over-expression of the 
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ep(2)1024 in the soma, significant rescue of ttie mutant phenotype was 
observed in ix>th females and males. Specifically, in females, genniine stem 
cells are maintained and a string of developing egg chambers was observed. 
Similarly, in males, gentiline stem cells were also rescued, and 
5 spermatogenesis was fully restored. 

To distinguish whether the spectrosome-containing cells are stem cells 
or cystoblasts. the gennaria were stained with anti-cytoplasmic BAM (BAI\^-C) 
antibody, since BAM-C is only expressed in cystoblasts and early mitotic cysts 
but not in GSCs (McKearin and Ohistein, (1995)). In p/w/-overexpressed 

1 0 gennaria, BAM-C staining is strongly present in early cysts. However. BAM-C 
staining is conspicuously absent from all the spectrosome-containing cells. 
This observation suggests that the ectopicaliy induced spectrosome-containing 
cells are GSC-like cells. 

The observation that somatic PIWI over-expression increases the 

1 5 number of genniine stem cells was supported a confocal z-senes analysis (1 0 
pm stacks) of wild-type germarium immunofluorescently labeled with anti-1 81 
antibody. At the apical tip of region 1 in direct contact with the terminal 
filament, two germline stem ceils containing spectrosomes (Sp) were observed. 
Posterior to germline stem cells, developing germline c^sts containing 

20 branched fusomes (Fu) were observed. A germarium from a ep(2)1024; 
hsGal4 female was observed to display many germline stem-like cells following 
4 days of heat shock treatment with 12 shown in this focal plan as indicated 
by spectrosomes (Sp; stained with anti-1 B1 antibody) and the absence of the 
BAM-C protein. The stenrv-like cells were evenly distributed between regions 

25 1 and 2a. The BAM-C protein was only present in a cyst containing a branched 
fusome in region 2a. 

A gennarium containing ectopic stem-like cells was stained with anti- 
BrdU and anti-a-spectrin antibodies following BrdU incorporation as described 
above in the Materials and Methods. Some mitotically active germline cells 

30 which have incorporated BrdU in their nuclei were observed. A fusome- 
containing germline cyst (Fu) was present in region 1 but not in contact with 
somatic cells. A gennarium from an ep(2} 1024; hsGal4 female subjected to 4 
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days of heat shock treatments followed by 4-5 days of culture at room 
temperature was also stained. The stainings indicated anti-VASA and anti-1 B1 
antibody signals, respectively. This germarium, like a typical wild-type 
genmarium, contained two genmline stem cells contacting the apical somatic 
5 cells and a cystoblast that is one cell away from the tenminal filament. In 
addition, the number of fusome-containing cysts (Fu) have increased to a 
normal level. 

Figure 7 depicts a graphical comparison of the number of spectrosomes 
present per wild-type gemnarium versus per p/w/-overexpressed germarium. 

10 The number of germaria examined (n) is in parentheses. All germaria are 
oriented with anterior toward the left. 

The ectopic GSC-like cells appear to be ftjnctional GSCs. First, they 
incorporate BrdU, an indicator of DNA replication, at a level similar to the wild- 
type GSCs. This suggests that they are not arrested in the cell cycle. Second, 

15 these GSC-like cells can ail differentiate within four days following the 
withdrawal of heat shock; leaving the gemiaria with only 2-3 GSCs at their 
normal locale. No signs of cell death, such as pycnotic nuclei or apoptotic 
bodies, were detected by DAP! staining and Nomarski optics. This suggests 
that the GSC-like cells are capable of oogenesis. Thus, the dependence of 

20 GSC number on the PIWI level reveals that pm^-mediated somatic signaling 
controls the number of GSCs via a dosage dependent mechanism. 

The ectoplcaily induced GSCs show an even distribution pattern along 
the germarial axis. In wild-type genmaria, spectrosome-containing cells are 
exclusively located in the apical area of regton 1, either in direct contact with 

25 or one-cell away from the tenminai filament (for stem cells and cystobiasts, 
respectively). However, following PIWI overexpresston, spectrosome- 
containing cells were evenly expanded along the somatic gemnarial wall to both 
regions 1 and 2a. All the spectrosome-containing ceils are in apparent contact 
with the somatic cells; there is no apparent gradient of the stem-like ceil 

30 distribution towards the terminal filament. In wild-type germaria, inner sheath 
cells express as much PIWI as cap cells, and much more than the main 
tenninal filament cells, yet there are no GSCs located near them in posterior 
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region 1 or region 2a. Therefore, the appearance of the ectopic stem-like cells 
in the entire regions 1 and 2a following p/tvk>verexpression suggests that a 
very high level of PIWI in the inner sheath cells expands the stem cell domain 
to region 2. 

5 Over-expression of PIWI increases the rate of aermline stem cell 

division. Since over-expression of piwi in somatic cells increases the number 
of GSCs, yet loss of p/vw function from the soma and germline in piwi mutants 
abolishes stem cell division and thus their maintenance, one might expect that 
over-expression of p/w/in the soma may also increase the rate of GSC division. 

10 To address this possibility, the mitotic index of GSCs in p/w/-overexpressed 
females was directly quantified. GSC division is unique in that it has an 
unusually long telophase-to-interphase period characterized by the presence 
of a dumb bell-shaped telophase spectrosome spanning the two daughter cells. 
Deng and Lin, (1997); de Cuevas and Spradling, (1998). The morphologically 

15 distinctive telophase spectrosome provides an easy marker for the accurate 
measurement of the telophase/early interphase Index, which is a direct 
quantitation of the frequency of GSC division. 

To avoid a bias in quantifying the division frequency due to the presence 
of an increased number of GSC4ike cells in p/iv/-overexpressed germaria, the 

20 2-3 anterior-most GSCs that contact the terminal filament were quantified, and 
the same number of stem cells in the wild-type and uninduced ep(2)1024 
controls were quantified. In tiie wild-type and uninduced ep(2)1024 controls, 
20-25% of gennaria contained a single GSC witii a telophase spectrosome; 
only 2-3% of germaria contained two GSCs with telophase spectrosomes. 

25 These frequencies are increased to 31% and 13% in pnv/-overexpressed 
germaria. Moreover, 3% of the p/w/-overexpressed germaria contain three 
GSCs with a telophase spectrosome, a situation that was not observed in the 
control germaria. These results indicate that in addition to an increase in the 
number of stem cells, piwi overexpression also increases the rate of GSC 

30 division. Because the GSCs under examination are located in their normal 
niche, the increase in their mitotic frequency should be mostiy, if not all, due to 
the increased PIWI expression in the tenfninal filament. 
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Four micrographic panels of anti-1B1 images of germaria containing 0, 
1, 2, and 3 telophase spectrosome-containing genniine stem cells were 
prepared and examined. The frequency of observing such germaria in the 
wild-type control, the uninduced ep(2) 1024 control, and the p/wk>verexpressed 
5 germaria is presented in Table 5. The number of germaria counted for each 
genotype (n) is given in parentheses. 

Table 5 

Percentage (%) of GSC in Telophase 



Telophase 


W.T. 


EP(-)HS 


EP (+) HS 


GSCs 


fn=100) 




(n=100) 


0 


73 


77 


53 


1 


25 


20 


31 


2 


2 


3 


13 


3 


0 


0 


0 



15 Discussfon 

Stem cells are characterized by two common properties: the capacity for 
self-renewal and the ability to propagate numerous progeny fated forfurither 
differentiation (for review, see Monison et a!., 1997; Lin, 1997). The unique 
self-renewing division of stem cells is controlled by both intracellular and 

20 extrinsic signaling mechanisms (Knoblich, 1 997; Lin, 1 997). Example 5 shows 
that p/vv/ encodes a novel nucleoplasmic protein present in both somatic and 
germiine cells of both sexes. Moreover, the single stem cell clonal analyses 
and piwi overexpression experiments establish the important function of piwi 
in both the intracellular and the cell-cell signaling mechanisms. Example 5 thus 

25 reveals a novel nuclear factor whose activity modulates the number of GSCs 
and the rate of their division. 

The role of PIWI as a nucleoplasmic factor in somatic signaling. The 
immunofluorescence analysis of the fully functional myc-PIWI protein reveals 
that PIWI is a nucleoplasmic protein. Consistent with RNA in situ analyses, 

30 PIWI is present in both the soma and the gemiline during oogenesis. In 
Example 5 the cell-autonomous function of PIWI as a mitotic promoter in GSCs 
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is revealed. PIWI is localized In the nucleoplasm in both the apical somatic 
cells and GSCs. Hence, the nucleoplasm is the likely site of PIWI function, and 
In accordance with one aspect of the present inventbn, PIWI is the first 
nucleoplasmic factor known to play a key role in controlling GSC division. 
5 As a nucleoplasmic factor in the apical somatic cells, PIWI is expressed 

in three types of post-mitotic cells that form a somatic cap surrounding GSCs 
and their immediate daughter cells: the main terminal filament cells, the cap 
cells, and the inner sheath cells. Among these cell types, the main terminal 
filament cells express a very low level of PIWI while cap cells and inner sheath 

10 ceils express a very high level of PIWI. Since cap cells are in direct contact 
with GSCs, it is envisioned that PIWI expression in cap cells plays a role in 
somatic signaling. 

Since PIWI is a nucleoplasmic protein, it is unlikely a somatic signal 
itself, but rather an essential component of the somatic signaling machinery 

1 5 responsible for producing the signal. Given its nucleoplasmic localization, PIWI 
might be involved in post-transcriptional mRNA processing in the nucleus or 
in nuclear functions indirectly related to gene expression. In either case, the 
somatic activity of PIWI acts via a dosage dependent mechanism to control the 
number of GSCs In the germline. Therefore, piwi can help to define a stem cell 

20 niche in the germarium for GSC maintenance, with the size of the niche 
corresponding to the level of PIWI activity. 

A number of genes, including fs(1)Yb (King and Lin, 1999) and dpp (Xie 
and Spradling, 1998), have recently been shown to play roles in the cell-cell 
signaling mechanism in maintaining GSCs. Among these genes, dpp is the 

25 only known signaling molecule. DPP over-expressk>n dramatically expands the 
number of GSCs and prevents their differentiation, leading to tumorous 
gemnaria filled with undifferentiated GSC-iike cells (Xie and Spradling, 1998). 
This overexpression phenotype, in sharp contrast to that of piwi, suggests that 
DPP is not the downstream signal of the p/W/-mediated mechanism, even 

30 though it may be one of the signals. fs(1)Yb, on the other hand, shares a very 
similar loss-of-function phenotype with piwi. These two genes are more likely 
to be involved in the same somatic signaling pathway. 
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The role of PIWI as a nudeoplasmic factor in aemriline stem cells. PIWI 
is expressed dynamically in the germline of the germarium. in region 1 , PIWI 
is present in the nucleus of GSCs at a high level but is sharply down regulated 
in cystoblasts and early mitotic cysts. This expression pattern conrelates welt 
5 with the mitosis-promoting function of PIWI in GSCs as revealed by single stem 
cell clonal analysis. The differential distribution of proteins between the stem 
cell and the cystoblast has also been observed for several other factors known 
to be important for GSC division or differentiation. For example, PUM is 
required for GSC maintenance (Lin and Spradiing, 1997; Forbes and Lehmann, 

10 1998) and is accumulated at a high level in GSCs but a low level in the 
cystoblasts (Forbes and Lehmann, 1998; Paris! and Lin, 1999). On the other 
hand, NOS and BAM-C are required for cystoblast development and are 
localized either preferentially or exclusively in the cystoblast. 

All of these molecules, except for BAM-C, are known to play important 

1 5 roles in other developmental processes (Cox et al., 1 998; Paris! and Lin, 1 999). 
For example, the gemnline expression of PIWI during oogenesis also provides 
PIWI as a maternal component essential for embryogenesis (Cox et al., 1998). 
Therefore, the asymmetric fates between GSCs and cystoblasts might not be 
determined by unique "stem cell factors" or "cystoblast factors". It, instead, 

20 may be created by unique combinations of various regulatory molecules that 
are repeatedly used throughout development. 

This pleiotropic function of stem cell genes is also manifested within the 
stem cell mechanisms, in which p/w/ is involved in t)oth somatic signaling and 
in cell-autonomous mitotic functions, pnv/ thus acts in two distinct cell types 

25 and in two different mechanisms. Although applicant does not desire to be 
bound by any particular mechanism of actbn, it is contemplated ttiat the PIWI 
protein directly or indirectly mediates gene expression in the nucleus either at 
the transcriptional or post-transcriptional level. The different piwi target genes 
or their products in different cell types then lead to the distinctively different 

30 cellular function of p/w/. 
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Example 6 

PIWI is a Pole Plasm Component Required for 
Pole Cell Fonmatlon. Migration and Transcriptional Quiescence 
This Example shows that PIWI is also a matemally deposited polar 
5 granule component that plays a key role in pole cell formation, migration, and 
transcriptional quiescence. However, unlil^e most other genes l<nown to be 
required for pole cell formation, p/Vw is specifically required for pole cell 
formation with no effect on abdominal patterning. PIWI is dynamically 
expressed during genu plasm assembly and pole cell formation. The protein 
10 is first expressed in polar granules at the posterior end of the embryo and is 
subsequently expressed in the nuclei of all primordial germ cells throughout 
embryogenesis. Furthemnore, the piwi gene product directs germ plasm 
assembly and controls the number of pole cells formed at the posterior pole of 
the Drvsophila embryo. These results indicate that piwi has many of the 
1 5 properties of a genm cell-specific detemiinant. 

Materials and Methods 
DrosoDhiia strains and culture. All strains were grown at 25''C on yeast- 
containing molasses/agar medium. The following fly strains were used in this 
study: w; p/w/^, FRT^OA/CyO and w; piwi^ FRT^OA/cyO (as disclosed in 

20 Examples 1-4) were used to generate maternal clones; y, w PjhsFLPJ^^; 
P[ov(P^f^, FRT*0A/cyO (Chou and Perrimon. 1996); P[myc-PIWI] insertions 
on the second and X chromosomes were used to increase the piwi copy 
number as well as to visualize PIWI protein expression; 4X os/carflies bear two 
wild-type copies of os^ar on the third chromosome and two transgenic rescue 

25 copies of osl(ar on the second chromosome (Ephrussi and Lehmann, 1992. 
OregonR (Om-R) senses as the wild-type control in all experiments. 

Immunohistochemistry. Ovaries were dissected, fixed and stained as 
described In Lin ef a/., 1994. Embryos were collected, fixed and stained 
according proceduredescribed by Patelef a/. (1989). For immunofluorescence 

30 staining the following antisera were used: polyclonal anti-VASA antibody 
(1 :2000; Hay ef a/., 1 990); monoclonal antl-MYC epitope antibody (1 :50; Evan 
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etaL, 1985); monoclonal antl-SXL antibody (1:500; Bopp etaL, 1991). All the 
fluorescence-conjugated secondary antibodies were from Jackson 
Immunoresearch Laboratory, Wes^rove, Pennsylvania and were used at 1 :200 
dilution, immunofluorescently labeled samples were also counterstained with 
5 DAPI as described previously (Lin and Spradling, 1993). Micrographs werB 
taken using either a Zeiss AXIOPLAN^ microscope or a Zeiss LSM410™ 
confbcal microscope (Zeiss, Oberkochen, Gemiany) as described herein 
above. 

Maternal clones. To generate embryos maternally depleted for piwi the 
1 0 following fly strains were crossed: y, wP[hsFLP]12; p[ovoD1]2L FRT^OA/cyO 
males were crossed to w; piwi^, FRT^^^/CyO virgin females to produce y,w 
P[hsFLP]12; p[ovoD1]2L FRT^OA/piy^jl ^ FRT^OA progeny. Identical crosses 
were perfomied with w; p/iv/2 FRT'^OA/cyO virgin females. Crosses were 
carried out for two days to produce transheterozygous progeny. After 2 days, 
15 adults were transfen^d to fresh vials. Lan/ae were heat shocked on days 3 
and 4 for one hour in a 37*C water bath to induce mitotic recombination. The 
heat-shocked females with germline clones were crossed to Ore-R males or to 
p/yv/2 mutant males. Embryos collected ftiom these females were subsequently 
analyzed by immunofluorescence microscopy. Due to the prosence of a copy 
20 of the ovcP^ gene in these females, only piw'r genmltne clones can give rise 
to a mature egg. 

Pole cell fo miation is dosaoe-dependent for p/wi. Single copy embryos 
are from piwiVCyO females and two copy embryos for botti piwi and os/rar are 
from Ore-R females. For the four-copy piwi embryos, a PImyo-PIWI] roscue 

25 transgene was homozygoused on a wild-type second chromosome; two such 
lines were examined for their effects on embryonic development. For the four- 
copy os/f embryos, an osic rescue transgene was homozygoused on the 
second chromosome of an othenA^ise wild-type fly as described by Ephrussi 
andLehmann, 1992. For the six-copy p/w/ embryos, P[myc-PIWI]transgenes 

30 on the X chromosome and second chromosome were homozygoused to 
generate four transgenic copies in addition to the two wild-type copies on the 
second chromosome. For matemally depleted embryos, germline clones were 
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produced as described above. Embryos from females carrying germline clones 
were collected and analyzed to examine the role of piwi in gennline 
development. For pole cell counts, all embryos were stained with anti-VASA 
antibody and the pole cells of 50 embryos of each matemal genotype, 
5 described above, were counted in triplicate for each embryo. The pole cell 
count reflects the initial number of pole cells piBsent shortly after budding at the 
posterior pole and prior to any amplification or migration. Embryo staging was 
peribnned as described In Zaiokar and Erk (1976) according to the number of 
nuclei present. 

10 Results 

PIWI encodes a polar granule component with a dynamic behavior 
during pole cell formation. As disclosed above, piwi is maternally required for 
embryogenesis. To examine piwi function in embryogenesis. PIWI protein 
expression in embryos was analyzed using a myc-tagged piwi transgene. 

15 Embryos were collected from females canying a P[myc-PIWI] transgene and 
subsequently stained these embryos with antibodies against MYC to visualize 
P[myc-PIWI] protein expression and VASA. a polar granule component and 
pole cell-specific mariner. Embryos were counterstalned with DAPI to visualize 
nuclei and allow a stage-specific analysis of PIWI expression during 

20 embryogenesis (Zaiokar and Eric, 1976). 

PIWI is first expressed in a crescent at the posterior pole of the embryo 
during mitotic cycles 1-7, where it is co-localized with VASA during these 
cydes, including localization to polar granules of which VASA is a component. 
Thus, piwi encodes a polar granule component. During mitotic cycles 8-10, 

25 PIWI Is dynamically expressed in the fomning pole cells. At this stage in 
embryogenesis, nuclei have migrated from the center of the embryo and some 
of them enter the posterior pole plasm. These posterior nuclei become the first 
to cellularize as they bud off from the embryo to form pole cells. At the onset 
of pole cell formation, during mitotic cycles 8 and 9, PIWI is expressed 

30 cytoplasmically in an overiapping pattern with VASA in polar buds. However, 
at mitotic cycle 10 when pole cell fomiation and budding is complete PIWI 
completely translocates to the nucleus of ail pole cells. PIWI is also zygotically 
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expressed in somatic cells at the stage. PIWI remains nuclear in pole cells 
throughout germ cell migration and finally gonad coalescence. 

Maternal PIWI is required for pole cell formation. To analyze the 
potential requirement of p/iv/ in embryogenesis embryos matemaliy depleted 
5 for piwi were generated. Females with piwi^ gemiline stem cell clones were 
crossed to wild-type males and embryos Irom these females were collected. 
To assay germline formation and development, germ cells were visualized by 
staining with an anti-VASA antibody. In the absence of maternal piwi product, 
a dramatic reduction in the number of pole cells formed was observed. 

10 Maternally depleted piwi embryos contain on average 7.5 ± 5 pole cells at 
mitotic cycle 10, the pole cell fonnation stage, in contrast to the 24 ± 5 pole 
cells formed in wild-type embryos at the same stage. These results indicate 
that maternal piwi is required for pole cell formation as a wild-type paternal 
copy of piwf^ is insufficient to rescue the pole cell fonnation defect. 

1 5 Following pole cell formation there are typically one to two amplificatory 

mitoses in the pole cell population such that 30-40 pole cells are present prior 
to germ cell migration. There is no additional amplification of the pole cell 
population once gemi cell migration has commenced (Williamson and 
Lehmann, 1 996). To detemnine whether piwi is required for the amplification 

20 of the pole cell population, embryos lacking maternal p/vw for the number of 
pole cells present before germ cell migration and after gonad coalescence 
were examined. There is no amplification of germ cells in the absence of 
maternal piwi; rather, a small decrease in the average number of germ cells 
present at gonad coalescence was observed. Prior to gemri cell migration p/wr 

25 embryos contained 7.5 ± 5 genu cells, however at gemiline coalescence the 
average number of genu cells dropped to 4.5 ± 3 genm cells. Thus, in addition 
to its requirement for pole cell fomnation, matemaliy deposited piwi also plays 
an role pole cell amplification and survival. 

The maternal effect of piwi on pole cell formation is similar to that 

30 observed with the posterior group genes which are also required for pole plasm 
assembly and pole cell fomnation. To examine whether matemaliy depleted 
piwi embryos have any abdominal defects, Nomarski optics were used to 
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examine the segmentation pattern of these embryos. There is no readily 
apparent defect in abdominal patterning. Thus, in contrast to the posterior 
group genes, piwi does not appear to be required for proper abdominal 
patteming of the embryo. 
5 Matemal PIWI is required f or pole cell migration. To analyze the effect 

of removing matemal piwi function on gernn cell migration, embryos from 
females bearing piwi mutant gemiline stem cell clones were collected and 
stained forthe genm cell specific maricer VASA. In addition to its role for proper 
pole cell formation and proliferation, p/iv/ activity also plays a role in successful 

1 0 genu cell migration. During pole cell migration in wild-type embryos, gerni cells 
exit the posterior midgut and then actively migrate through the endodemi along 
its basal surface until making contact with the overiying mesodermal cells 
(Wanior, 1994). At late stage 10 to early stage 1 1 (for staging, see Zaiokar 
and Erk, 1976), gemi cells transfer from the endoderm into the mesodemi 

1 5 where they will associate with somatic gonadal precursor cells which give rise 
to the gonadal mesodenrn. Finally, at stage 14 of embryogenesis, gemi cells 
coalesce with somatic gonadal precursors in parasegment 10 to establish the 
primordial gonad. 

However, in embryos lacking matemal PIWI, defects in gemi cell 
20 migration are seen from stage 10 onwards which marks the active phase of 
gemi cell migration. As germ cells exit from the posterior midgut pocket, they 
fail to undergo directional migration over the surface of the endodenm. Rather, 
they are found migrating in a random fashion to many different sites within the 
embryo. In some cases highly abenrant gemi cell migration into the anterior 
25 end of the embryo was observed. The random pattems of germ cell migration 
in these mutant embryos indicates a requirement for matemal PIWI in germ cell 
guidance. In addition to the defect in gemi cell migration, defects in overall 
pole cell morphology were observed. In contrast to wild-type pole cells, pole 
cells maternally depleted of piwi are of variable size and shape and display 
30 unusual blebbing and aberrant pseudopodia. This indicates that matemal piwi 
also plays a role in proper pole cell morphology which might be essential for 
proper pole cell migration. 
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Previously, it has been demonstrated that signals produced by 
surrounding somatic tissues act to guide germ cells by both attractive and 
repulsive forces (Jaglarz and Howard, 1994; 1995; Zhang et aL, 1997; Van 
Doren et aL, 1998). However, few genes have been identified that control 
5 gemi cell migration and are expressed in genm cells themselves. The 
expression of piwi in pole cells during migration suggests that p/iy/ function is 
cell-autonomous. This is similar to nanos and pgc-1 both of which are 
expressed in gemi cells and are required for normal germ cell migration 
(Forbes and Lehmann, 1998;Kobayashiefa/., 1996). These data indicate that 
10 gemi cell migration is controlled by both instructive signaling mechanisms as 
well as by cell-autonomous mechanisms within migrating germ cells. 

Maternal PIWI plavs a role in the transcriptional quiescence of pole cells. 
Transcriptional repression in gemi cells has been implicated as a key 
mechanism in the establishment and maintenance of the germ cell fate (for 

1 5 review, see Seydoux and Stnome, 1 999). In Drosophila, zygotic transcription 
in the soma is detectable as eariy as one hour after egg laying (AEL; Pritchard 
and Schubiger, 1 996). In the gemiline, however, general RNA synthesis is not 
detected until 3.5 hours AEL at embryonic stage 8 (Zaiokar, 1 976) and poly (A) 
containing mRNA transcripts are not obsen/ed in eariy germ cell nuclei (Lamb 

20 and Laird, 1976). Few genes have been shown to be specifically transcribed 
In pole cells and among the known genes, vasa is the eariiest gene at 
embryonic stage 9-10 (Hay ef a/., 1988b). Transcription of all other genes to 
be expressed in pole cells is not observed until the primordial gonad reinitiates 
proliferation at embryonic stage 16-18 (Mevel-Ninio etaL, 1 995). Transcription 

25 is reinitiated In pole cells just before they migrate through the midgut epithelium 
approximately 3,5 hours AEL (Zaiokar, 1976). 

To analyze the potential role oipiwi in the regulation of transcriptional 
quiescence in eariy germ cells, embryos maternally depleted of piwi were 
double-labeled with VASA, a germ cell specific marker, and SXL, a somatic 
30 sex-specific marker which is nomnaliy absent in pole cells. Ectopic expression 
of SXL protein in pole cells prior to embryonic stage 9 was observed. This 
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indicates that the mechanism which nomially maintains transcriptional 
quiescence in pole cells is dependent on proper p/Vw function. 

p/iv/Dose Detenmines the Initial Number of Pole Cells. Given the critical 
role of pm in pole cell formation and development, the role of piwi in germ 
5 plasm assembly was further evaluated by examining the relationship between 
piwi gene dosage and the number of pole cells fomied, which is also a 
reflection of the amount of pole plasm formed at the posterior end of the egg. 
The piwi copy number was increased in females from one to four and collected 
embryos from these females to examine the number of pole cells fomied. In 

10 identically staged samples, embryos with one or two copies of the piwi gene 
contained on average 24 ± 5 pole cells. However, a two-fold increase in piwi 
gene dosage over wild-type levels yields embryos containing on average 50 ± 
7 pole cells at the same developmental stage. This phenotype is strikingly 
similar to the phenotype displayed when os/car gene dosage is increased from 

15 one to four copies (Ephoissi and Lehmann, 1992), Like osl<ar, a two-fold 
increase in piwi gene dosage leads to an increase in both the amounts of 
known pole plasm components, such as VASA, at the posterior pole of the 
embryo and the number of pole cells fonned. However, in contrast to oskar, 
increasing piwi gene dosage does not cause defects in abdominal patterning 

20 or polarity (Ephrussi and Lehmann, 1992). In support of the apparent linear 
effect of piwi gene dosage on the number of pole cells formed, we found that 
a three-fold increase in piwi gene dosage results in embryos containing on 
average 66 ±7.5 pole cells, representing a three-fold increase over the number 
of pole cells in wild-type embryos at the same developmental stage. 

25 Discussion 

piwi Diavs a role in embrvonic oermline determination, piwi encodes a 
polar granule component which is maternally required for pole cell formation. 
However, unlike other posterior group genes which are also required for 
abdominal patterning, piwi is specifically involved in pole cell formation. This 
30 Example indicates that, unlike the posterior group genes, piwi does not appear 
to be required for the proper localization of the abdominal detemninant, nanos. 

At the molecular level, germ plasm assembly occurs in a stepwise 
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ordered fashion. The presence of the VASA protein In the small population of 
pole cells fbrnied in embryos lacking matemal piwi indicates that piw must act 
downstream of vasa and therefore oster in pole cell fonnation (for review, see 
Rongo and Lehmann, 1996). In this hierarchy of gene activity, oskar is 
5 required for the localization of vasa and tudorto the pole plasm. In addition, 
all three gene products are polar granule components which like piwi are 
required for pole cell fomiation. However, p/vw has a role in pole cell fomiatlon 
which is downstream of the requirements for germ plasm assembly. This 
requirement is very similar to that reported for other putative germ cell 

10 detenninants including, germ ce/Wess (gel) (Jongens et al, 1992), polar 
granule compor\ent-1 {pgc-l) (Nakamura et ai, 1996), and mtir RNA 
(Kobayashi et aL, 1993). All of these genes act downstream of tud and are 
required for pole cell fomiation, but not for abdominal segmentation. These 
results suggest piwi can act as a maternally deposited gemi cell detemilnant 

1 5 downstream of tud as well. 

Matemal P IWI level detemfiines the initial number of pole cells. A direct 
relationship was obsen/ed between the dosage of the piwi gene and the 
concentration of other pole plasm components, such as VASA, at the posterior 
pole of tiie embryo. This phenotype is highly similar to that observed when 

20 os/farcopy number is increased in embryos (Ephrussi and Lehmann, 1992). 
For both piwi and osl<ar, an increase in gene copy number leads to an 
accumulation of pole plasm components which is independent of tiie dosage 
of other component genes. Therefore, both piwi and oslcar can modulate the 
number of pole cells fomned through a gene dosage effect via regulating tiie 

25 amount of pole plasm. Pole cell fomiation involves a hierarchy of gene 
expression. It has been proposed that oslcar acts as a limiting factor for all 
aspects of pole plasm function (Ephrussi and Lehmann, 1992). The highly 
similar phenotypes obsen^ed in embryos carrying four copies of either piwi or 
osl(ar, suggest that osk is not the only limiting factor for gemi cell fomiation and 

30 that piwi might be a downstream target of oskar in detennining pole cell 
fomiation. The presence of VASA protein in germ cells of embryos maternally 
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depleted for piwi also suggests that piwi Is required downstream of oskartor 
pole cell formation (Ephrussi and Lehmann, 1992). 

As disclosed herein, piwi is a member of a large class of evoluttonarily 
conserved genes. Intriguingly, piwi 'is related to aubergine, a Drosophila gene 
5 required to promote os/car translation (Wilson ef a/., 1996). piwi, however, is 
not likely to promote os/fartranslatlon since p/iy/-depleted embryos do not show 
os/c-like defects. Altematively, piwi may act downstream of osl<an but in a 
completely os/car-independent fashion. 

Previous analyses have revealed that pole plasm assembly is a two-step 
10 process beginning in oogenesis. The first step is the localization of oskar 
mRNA to the posterior pole of the oocyte. The second step is the recruitment 
of vasa and tudor by os/car. Both vasa and tudor are required for pole plasm 
assembly. Downstream of tudorthere is a bifurcation of the hierarchy leading 
to gemi cell detenmination and the proper localization of the abdominal 
15 detentiinant nanos, respectively. In the germ cell detemiinatlon branch are 
potential germ cell detemnlnante which are required for pole plasm assembly 
and pole cell fonmation, but are not required for at>dominal patterning. The 
data presented in the Examples suggests that piwi is likely to act downstream 
of osicar in the genu cell determination branch as a genm cell determinant. 

20 piwFs potential role as a gemri cell detemiinant can be further addressed 

by ectopically expressing the piwi mRNA at the anterior end of the embryo to 
examine whether it can induce pole cell formation. This approach has used in 
identifying molecules crucial to genu cell determination (Ephrussi and 
Lehmann, 1992; Jongens ef a/., 1994). 

25 Maternal PIWI is required for migration and transcriptional quiescence 

of oerm cells. In addition to initial pole cell formation, p/w/ activity is required for 
successful germ cell migration. Maternal PIWI is nuclearly localized to all pole 
cells as they form and remains nuclear throughout genn cell migration and 
primordial gonad fonnation. In the absence of maternal PIWI, gemn cells 

30 migrate in a highly randomized fashion with few germ cells actually reaching 
the gonadal mesodemn. Recent studies have identified several genes which 
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regulate germ cell migration (Moore etaL, 1998; Zhang etaL, 1996), however, 
unlike piwi\ all of these genes appear to be required in the soma and none of 
these mutants result in such randomized migration of germ cells as that 
observed in piwi mutant germ cells. In addition to p/iv/, matemal NOS has also 
5 been shown to act cell-autonomously in germ ceils to facilitate their migration 
(Forbes and Lehmann, 1 998). Somatic signaling has been shown to exert both 
attractive and repulsive forces on genn ceils in order to guide their migration 
(Zhang etal., 1996; Van Doren etaL, 1998a). The abenrant germ ceil migration 
observed in embryos iaclcing matemal PIWI could result from a failure to 
10 respond to these somatic guidance cues perhaps due to the absence of 
specific receptors for the cues on the surface of gemi cells. In any case, these 
results indicate that soma-germline signaling is essential for proper germ cell 
migration and that maternally supplied products, such as PIWI and NOS, are 
required for successful germ cell migration. 

1 5 The small decrease in the average number of germ cells after migration 

was initiated in p/\v/-depleted embryos indicates that matemal PIWI also plays 
a role In germ cell survival. Both in vitro and in vivo studies suggest that the 
mammalian stem cell genes ]Miite (W) and Steel {SI) also function in the 
survival of primordial genn cells (Gomperts et aL, 1 994). SI mutants are sterile 

20 due to poor genn ceil survival and improper migration (Witi:e, 1 990). Therefore, 
PIWI and SL play a similar role in the proliferation and migration of primordial 
germ cells. However, the SL function is zygotically required, just as nos and 
ovo in Drosoptiila, piwi is the first clearly demonstrated case where a 
matemally provided product functions in the survival of pole cells during 

25 embryogenesis. 

This Example further reveal that matemal PIWI is required for 
transcriptional repression in eariy germ cells. Transcriptional repression has 
been implicated as an essential mechanism in the establishment and 
maintenance of the genn cell fate (for review, see Seydoux and Strome, 1 999). 
30 Previous studies indicate that the repression of RNAP II activity in eariy gerni 
cells may be part of an evolutionarily consen/ed mechanism that distinguishes 
the gennline from the soma during eariy embryogenesis (Seydoux and Dunn, 
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1997). Germ cells in both Drosophila melanogasterand C. elegans have been 
shown to lack a specific phosphoepltope of RNAP II until gemi cells associate 
with the gut primprdium during gastrulatlon (Seydoux and Dunn, 1997). The 
phosphorylation of RNAP II is a prerequisite for its transcriptional activity. 
5 Thus, the absence of the phosphoepltope suggests that RNAP 11 activity is 
inhibited in eariy embryonic gemri cells (Seydoux and Dunn, 1997; Van Doien 
etaL. 1998). 

While applicant does not wish to be bound by a particular mechanism 
of action, piwi could function to repress transcription in eariy germ cells by 

10 regulating the expression of this specific phosphoepitope of RNAP II. In C. 
elegans, the pie-1 gene product has been shown to function in transcriptional 
repression of the genu cell lineage (Mello et ai, 1992). In Drosophila, piwi 
represents the first gene implicated in regulating transcriptional quiescence In 
germ cells. Prior to zygotic gene expression in the Drosopliila embryo, genu 

15 cells undergo a wide range of morphological and cellular behaviors including 
precocious cellularization, translational regulation (Dalby and Glover, 1993), 
germ cell-specific cell cycle regulation (Sonnenblick, 1941) as well as the 
development of a migratory cellular morphology (Jaglarz and Howard. 1995). 

Therefore, the genu plasm, which is maternally provided, must contain 
20 a combination of factors which controls these processes in eariy germ cells. 
Maternally provided factors in the gemn plasm must function to repress gemn 
cell gene expression as well as to activate specific gene expression in 
primordial gemi cells at the appropriate time. Thus, the mechanisms for timing 
the onset of zygotic gene expression in the genmline should be matemally 
25 provided as cell-autonomous factors to germ ceils. PIWI is envisioned to 
represent such a factor to maintain transcriptional quiescence until the 
appropriate developmental timepoint. The loss of transcriptional repression in 
germ cells lacking maternal PIWI could result in the loss of nomial germ cell 
properties due to differentiation, which could account for the defective pole cell 
30 migration. 

PIWI as a factor for maintaining totipotentcv in germ cells. Primordial 
germ cells are the only population of cells in the organism that retain totipotent 
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potential. Based on studies of genes such as pie-1, transcriptional repression 
has emerged as a Icey mechanism that prevents gemi cells from expressing 
genes that would cause them to enter other lineages and diffier^ntiate (Mello 
etal., 1992; for review, see Seydoux and Strome, 1999). p/vw is essential for 
5 the maintenance and proliferation of genmiine stem cells in both males and 
females. Gemnline stem cells, like primordial gemn cells, should be pluripotent; 
tiie maintenance of their identity is vital for the propagation of the species. 
Furthermore. PIWI is specifically expressed in all pole cell nuclei and is 
required for proper gemrj cell fomiation and transcriptional repression. 
10 Particularly, piwi gene dosage is directly related to the number of pole cells 
fomied. These data, taken together, suggest that piwi has many of the 
essential properties required for establishing and maintaining pluripotentcy 
during germline development 
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It will be understood that various details of the invention may be 
changed without departing from the scope of the Invention. Furtherniore, the 
foregoing description is for the purpose of illustration only, and not for the 
purpose of limitation-the invention being defined by the claims. 
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CLAIMS 

What is claimed is: 

1 . An isolated and purified biologically active p/iv/ family polypeptide 
comprising: 

5 (a) a polypeptide encoded by a nucleic acid sequence as set forth 

In any of SEQ ID N0s:1 , 3 and 5; 

(b) a polypeptide encoded by a nucleic acid having at least about 
75% or greater homology to a DNA sequence as set forth in any 
ofSEQIDNOs:1.3and5; 

10 (c) a polypeptide encoded by a nucleic acid capable of hybridizing 

under stringent conditions to a nucleic acid conrtprising a 
sequence or the complement of a sequence as set forth fn any 
ofSEQIDNOs:1.3and 5; 

(d) a polypeptide which is a biologically functional equivalent of a 
1 5 peptide as set forth in any of SEQ ID N0s:2. 4 and 6; 

(e) a polypeptide which is immunoiogtcally cross-reactive with 
antibodies which are immunologically reactive with peptides 
encoded by the nucleic acid sequences as set forth in any of 
SEQ ID N0s:2, 4 and 6; 

20 (f) a polypeptide encoded by a nucleic acid molecule capable of 

hybridizing to a nucleic acid molecule having the sequence of 
any of SEQ ID N0s:1, 3 and 5, or fragments or variants or 
complementary sequences thereof, under high stringency 
conditions; or 

25 (g) a polypeptide comprising a portion of a polypeptide of a), b), c). 

d). e). orf). 

2. The polypeptide of claim 1, wherein the polypeptide comprises 
a vertebrate p/iv/ family polypeptide. 

3. The polypeptide of claim 2, wherein the polypeptide comprises 
30 a human piwi family polypeptide. 
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4. The polypeptide of daim 1 , wherein the p/iv/ family comprises an 
amino acid sequence as set forth in any of SEQ ID N0s:2, 4 and 6. 

5. The polypeptide of claim 1 , modified to be in detectably labeled 

form. 

5 6. An isolated and purified antibody capable of specifically binding 

to the polypeptide of claim 1. 

7. The antibody of claim 6 which is a monoclonal antibody. 

8. The antibody of claim 6 which is a polyclonal antibody. 

9. A hybridoma cell line which produces the monoclonal antibody of 
10 claims. 

10. An isolated and purified antibody capable of neutralizing the 
biological activity of the polypeptide of claim 1. 

1 1 . The antibody of claim 1 0 which is a monoclonal antibody. 

12. The antibody of claim 10 which is a polyclonal antibody. 

15 1 3. A hybridoma cell line which produces the monoclonal antibody of 

claim 11. 

14. An isolated and purified polynucleic acid encoding a biologically 
active p/iv/ family polypeptide of claim 1. 

15. The polynucleic acid of daim 14, wherein the encoded 
20 polypeptide comprises a vertebrate piwi family polypeptide. 

16. The polynucleic acid of claim 15, wherein the encoded 
polypeptide comprises a mammalian p/vw family polypeptide. 

17. The polynudeic acid of daim 16, wherein the encoded 
polypeptide comprises a human p/iv/ family polypeptide. 

25 18. The polynudeic acid of daim 14, wherein the encoded 

polypeptide comprises an amino acid sequence as set forth in any of SEQ ID 
N0s:2. 4 and 6. 
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19. The polynucleic acid of claim 18, further comprising a piw 
lamlly-encoding polynucleic acid sequence as set forth In any of SEQ ID 
N0s:1,3and 5. 

20. The polynucleic add of claim 14, further defined as a DNA 
5 segment. 

21 . The polynucleic acid of claim 14, further defined as positioned 
under the control of a promoter. 

22. The polynucleic acid of daim 14, ftjrther defined as a recombinant 

vector. 

10 23. The polynudeic acid of claim 22, wherein the vector Is a 

recombinant expression vector. 

24. The polynucleic add of daim 22, further defined as a nucleic acid 
fragment of up to 50,000 basepairs in length. 

25. The polynudeic add of claim 24, further defined as comprising 
15 at least a 1,000 nudeotlde long contiguous stretch of a polynudeic add 

sequence as essentially set forth in any of SEQ ID N0s:1 , 3 and 5. 

26. A recombinant host cell comprising the polynudeic acid of claim 

14. 

27. The recombinant host ceil of daim 26, wherein the host cell is a 
20 procaryotic cell. 

28. The recombinant host cell of daim 26, wherein the host cell is a 
eukaryotic cell. 

29. A method of preparing a piwi family polypeptide, comprising: 
transforming a cell with the polynudeic acid of daim 14 to produce a p/iv/famlly 

25 under conditions suitable for the expression of said polypeptide. 

30. A method of detecting in a sample an RN A that encodes the piwi 
family polypeptide encoded by the nudeic acid of daim 14. said method 
comprising the steps of: 

(a) contacting said sample under hybridizing conditions with the 
30 polynucleic acid of claim 14 to form a duplex; and 
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(b) detecting the presence of said duplex. 

31 . A metliod of producing an antibody immunoreactive with a piwi 
family polypeptide, the method comprising steps of: 

(a) transfecting a recombinant host cell with the a polynucleic acid 
5 of claim 14, which encodes a piwi family polypeptide; 

(b) culturing the host cell under conditions sufficient for expression 
of the polypeptide; 

(c) recovering the polypeptide; and 

(d) preparing the antibody to the polypeptide. 

10 32, The method of claim 31, wherein the polypeptide comprises a 

polypeptide as essentially set forth in any of SEQ ID N0s:2, 4 and 6. 

33. The method of claim 31 , wherein the polynucleic acid comprises 
a polynucleic acid sequence as essentially set forth in any of SEQ ID NOs:1 , 
3 and 5. 

15 34. An antibody produced by the method of claim 31 . 

35. A method of detecting a piwi family polypeptide, the method 
comprising immunoreacting the polypeptide with an antibody prepared 
according the method of claim 31 to fonii an antibody-polypeptide conjugate; 
and detecting the conjugate. 

20 36. A method of detecting a messenger RNA transcript that encodes 

a p/W/ family polypeptide, the method comprising the steps of hybridizing the 
messenger RNA transcript with the polynucleic acid of claim 14 to form a 
duplex; and detecting the duplex. 

37. A method of detecting a DNA molecule that encodes a p/iv/fomily 
25 polypeptide, the method comprising the steps of hybridizing DNA molecules 

with the polynucleic acid of claim 14 tofomi a duplex; and detecting the duplex. 

38. An assay kit for detecting the presence of a piwi family 
polypeptide in a biological sample, the kit comprising a first container 
containing a first antibody capable of immunoreacting with a piwi family 
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polypeptlde of claim 1, wherein the first antibody is present in an anrount 
sufficient to perfomn at least one assay. 

39. The assay kit of claim 38, further comprising a second container 
containing a second antibody that immunoreacts with the first antibody. 

40. The assay kit of claim 39, wherein the first antibody and the 
second antibody comprise monoclonal antibodies. 

41 . The assay kit of daim 39, wherein the first antibody is affixed to 
a solid support. 

42. Theassaykitofdaim39, wherein the firstand second antibodies 
each comprise an Indicator. 

43. The assay kit of claim 42, wherein the indicator is a radioactive 
label, a fluorescent label or an enzyme. 

44. An assay kit for detecting the presence, in biological samples, of 
a p/w/ family polypeptide, the kit comprising a first container that contains a 
polynucieic acid identical or complimentary to a segment of at least ten 
contiguous nucleotide bases of the polynucieic acid of claim 14. 

45. An assay kit for detecting the presence, in a biological sample, 
of an antibody immunoreactive with a p/w? family polypeptide, the kit comprising 
a first container containing a piwi family polypeptide of claim 1 that 
immunoreacts with the antibody, with the polypeptide present in an amount 
sufficient to perform at least one assay. 

46. A method of screening candidate substances for an ability to 
modulate p/w/ family biological activity, the method comprising the steps of: 

(a) establishing replicate test and control samples that 
comprise a biologically active p/w/ family polypeptide; 

(b) administering a candidate substance to the test sample 
but not the control sample; 

(c) measuring p/w/family biological activity in the test and the 
control samples; and 
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(d) determining that tlie candidate substance modulates piwi 
family biological activity if the p/iv/ family biological activity 
measured for the test sample is greater or less than the 
piwi family biological activity measured for the control 
5 sample. 

47. The method of claim 46, wherein the candidate substance is 
further characterized as a candidate polypeptide, and further comprising the 
step of purifying and isolating a gene encoding the candidate polypeptide. 

48. The method of claim 46, wherein the replicate test and control 
1 0 samples further comprise a cell that expresses a biologically active vertebrate 

p/w/ family polypeptide. 

49. A recombinant cell line suitable for use in the method of daim 46. 

50. A method of nrx>dulating piwi family polypeptide activity in a 
subject, the method comprising the step of administering to the subject an 

15 effective amount of a substance capable of modulating the piwi family 
polypeptide activity In the subject, whereby modulation of the piwi family 
polypeptide activity is accomplished. 

51 . The method of claim 50, wherein the step of administering further 
comprises administering an effective amount of a substance that modulates 

20 expression of a piwi family-encoding polynucleic acid in the subject. 

52. The method of claim 51 , wherein the substance that modulates 
expression of a p/w/ family-encoding polynucleic acid comprises an antisense 
oligonulceotide. 

53. The method of claim 50. where the substance that modulates the 
25 piwi family activity comprises an anti-p/vw family antibody. 

54. The method of claim 53. where the anti-p/w/ family antibody 
comprises a monoclonal antibody. 

55. The method of claim 50, wherein the p/vwiamily activity comprises 
modulating cell growth, proliferation, division and combinations thereof, and 
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wherein the step of administering comprises administering to the subject an 
effective p/iv/family-modulating amount of a p/iv/ family-modulating substance. 

56. The method of claim 50, wherein the subject is a vertebrate. 

57. The method of claim 56, wherein the vertebrate is a warm- 
5 blooded vertebrate. 

58. A method of treating a subject suffering from a disorder 
associated with piwi family biological activity, the method comprising: 
administering to the subject an effective amount of a piwi family biological 
activity-modulating substance, whereby treatment of the disorder is 

10 accomplished. 

59. The method of claim 58, wherein the step of administering further 
comprises administering an effective amount of a substance that modulates 
expression of a p/wi family-encoding polynucleic acid in the subject 

60. The method of claim 59, wherein the substance that modulates 
1 5 expression of a piwi family-encoding polynucleic acid comprises an antisense 

oligonulceotide. 

61. The method of claim 58, wherein the substance capable of 
modulating piwi family activity in the subject comprises an anthpiwi family 
antibody. 

20 62. The method of claim 61, wherein the anti-p/w/ family antibody 

comprises a monoclonal antibody. 

63, A method of treating a subject suffering from a disorder 
associated with p/w/-mediated biological activity, the method comprising 
administering to the subject a therapeutic composition which comprises a 

25 biologically active piwi family polypeptide, whereby treatment of disorder 
associated with p/W-mediated biological activity in the subject is accomplished. 

64. The method of claim 63, wherein the therapeutic composition 
comprises a biologically active p/w/ family polypeptide and a phamiaceuticaily 
acceptable canier. 
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65. The method of claim 63, wherein the p/ivf-mediated biological 
activity comprises modulation of stem cell growth, proliferation, division and 
combinations thereof, modulation of primordial genn cell proliferation, and 
combinations thereof; and the piwi family polypeptide is administered in an 
5 effective amount. 

65. A method for modulating p/V^mediated biological activity in a cell, 
the method comprising: 

(a) delivering to the cell an effective amount of a DNA molecule 
comprising a polynucleotide that encodes a biologically active 

1 0 piwi family polypeptide; and 

(b) maintaining the cell under conditions sufficient for expression of 
said polypeptide. 

66. The method of claim 65, wherein the polypeptide comprises a 
polypeptide as essentially set forth in any of SEQ ID NOs:2, 4 and 6. 

1 5 67. The method of claim 66, wherein the polypeptide is encoded by 

a poiynucleic acid as essentially set forth in any of SEQ ID N0s:1 , 3 and 5. 

68. A method of culturing a primitive cell, the method comprising: 

(a) providing a culture comprising a primitive cell; and 

(b) delivering to the primitive cell a piwi family gene product, 
20 whereby growth of the primitive cell in a culture is accomplished. 

69. The method of claim 68, wherein the primitive cell is maintained 
in the culture for a time sufficient to produce a sustained cell culture. 

70. The method of claim 68, wherein the piwi family gene product is 
delivered to the primitive cell by delivering to the primitive cell an effective 

25 amount of a polynucleotide that encodes a biologically active piwi family 
polypeptide; and maintaining the primitive cell under conditions sufficient for 
expression of the polypeptide. 

71 . The method of claim 68, wherein the p/\v/ family gene product is 
delivered to the cell by administration of a biologically active piwi family 

30 polypeptide to the culture. 
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72. 



The method of daim 68, further comprising: 
collecting primitive cells; 



(a) 



(b) 



depositing the primitive cells in contact with a feeder matrix, the 
feeder matrix comprising a p/iv/ family gene product; and 



5 



(c) 



culturing the primitive cells on the feeder matrix in the presence 
of media for a time sufficient to produce a sustained primitive cell 
culture* 



73. The method of claim 68, wherein the feeder matrix comprises a 
recombinant cell comprising a polynucleic acid encoding a biologically active 

1 0 piwi family polypeptide. 

74. The method of claim 68, further comprising establishing a 
monolayer of cells in the feeder matrix. 

75. A transgenic or chimeric non-human animal having incorporated 
into its genome a polynucleic acid encoding a biologically active piwi family 

15 polypeptide, the polynucleic acid being present in said genome in a copy 
number effective to confer expression In the animal of the pm family 
polypeptide. 

76. The trangenic or chimeric non-human animal of claim 75, wherein 
said polynucleic acid comprises a human p/iv/ family-encoding segment. 

20 77. The transgenic or chimeric non-human animal of claim 75, 

wherein the expression of the piwi family polypeptide is conferred in 
reproductive tissue of the animal. 

78. The transgenic or chimeric non-human animal of claim 75, 
wherein the expression of the piwi family polypeptide confers an increased 

25 number of germline stem cells or primordial gemn cells in the animal as 
compared to a nonmal animal. 

79. The transgenicor chimeric non-human animal of daim 75, further 
comprising an additional polynudeicacid encoding a predetennlned biologically 
active polypeptide, the polynucleic acid being present in said genome in a copy 

30 number effective to confer expression in the animal of the polypeptide. 
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80. A method of altering the phenotype of an embryonic animal, the 
method comprising: 

(a) providing a recombinant primitive cell comprising a polynucleic 
acid encoding a biologically active p/v// family polypeptide; 

5 (b) transfecting the recombinant primitive cell with a polynucleic acid 

encoding a predetermined biologically active polypeptide; 

(c) transferring the transfected primitive cells into an embryo to 
confer expression of the predetemnined biologically active 
polypeptide animal of the polypeptide, whereby the phenotype of 
1 0 the embryo is altered. 

81. The method of claim 80, wherein the animal is a bind, and the 
transfected primitive cell is transferred into an egg containing an embryonic 
bird. 

82. The method of claim 80, wherein the altered phenotype 
15 comprises a change in protein expression, a change in growth rate, feed 

efficiency, a change in stem cell count, a change in primordial germ cell count, 
disease resistance, or a combination thereof. 

83. The method of claim 80, wherein the embryonic animal is 
permitted to develop into a fetus or animal. 

20 84. A method of recovering a protein from a transgenic or chimeric 

non-human animal, the method comprising: 

(a) providing a transgenic or chimeric non-human animal having 
incorporated into its genome a polynucleic acid encoding a 
biologically active piwi family polypeptide, the polynucleic acid 

25 being present in said genome in a copy number effective to 

confer expression in the animal of the piwi family polypeptide; 
and 

(b) recovering the biologically active p/iv/ family polypeptide from the 
animal provided in step (a). 
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85. The method of claim 84, wherein the transgenic or chimeric non- 
human animal provided in step (a) further comprises another polynucleic acid 
incorporated into its genome, the polynucleic acid encoding a predetemnined 
biologically active polypeptide, the polynucleic acid being present in said 
5 genome in a copy number effective to confer expression in the animal of the 
predetermined polypeptide; and wherein the predetennined biologically active 
polypeptide is recovered in addition to or instead of the biologically active p'm 
family polypeptide. 

85. The method of claim 84 or 85, wherein the transgenic or chimeric 
10 non-human animal provided in step (a) further comprises an avian, and the 

polypeptide is recovered from an egg produced by the avian. 

86. A phannaceutical composition comprising an isolated and purified 
biologically active p/iy/ family polypeptide and a pharmaceutically acceptable 
carrier. 

15 87. The phannaceutical composition of claim 86, wherein the 

polypeptide comprises a polypeptide as essentially set fbriih in any of SEQ ID 
N0s:2, 4 and 6. 

88. The phannaceutical composition of claim 86, wherein the 
polypeptide is encoded by a polynucleic acid sequence as essentially set forth 

20 in any of SEQ ID N0s:1, 3 and 5. 

89. The pharmaceutical composition of claim 86, further comprising 
a cell expressing the polypeptide. 

90. A phannaceutical composition comprising an isolated and purified 
polynucleic acid encoding a biologically active p/iv/ family polypeptide and a 

25 phannaceutically acceptable canier. 

91. The phannaceutical composition of claim 90, wherein the 
encoded polypeptide comprises an amino acid sequence as essentially set 
forth in any of SEQ ID N0s:2, 4 and 6. 

92. The pharmaceutical composition of claim 90, further comprising 
30 a p/iv/ family-encoding polynucleic acid sequence as essentially set forth in any 

of SEQ ID N0s:1.3and 5. 
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93. The pharmaceutical composition of claim 90, wherein the 
polynucleic acid Is a DNA segment. 

94. The phannaceutical composition of claim 90, wherein the 
polynucleic acid is positioned under the control of a promoter 

5 95. The phannaceutical composition of claim 90, wherein the 

polynucleic acid further comprises a recombinant vector. 

96. The pharmaceutical composition of claim 90, further comprising 
a cell expressing the encoded polypeptide. 
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<210> 1 
<211>3047 
15 <212>DNA 

<213> Drosophila 
<220> 
<221> CDS 
<222> (84)..(2612) 



20 <220> 

<221> mlsc_feature 
<222> (120) 

<223> n=a or c, Xaa=Leu or He 
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<220> 

<221> mlsc_feature 
<222> (399) 

<223> n=a or t, Xaa=Leu or He 
<220> 

<221> misc_feature 
<222> (2436) 

<223> n=a or c, Xaa=Leu or lie 
<400>1 

ctgagtccaa agcgtcgttt tcaaagtact ctttcagttt ccattgtgaa gttttaagtg 60 

atcgcgagtg ccaaaaagta aca ats get gat gat cag gga cgt gga cgc agg 113 
Met Ala Asp Asp Gin Gly Arg Gly Arg Arg 
1 5 10 

cgt cca ntt aac gaa gat gat tec tct act tec cga ggt agt ggt gat 161 
Arg Pro Xaa Asn Glu Asp Asp Ser Sar Thr Ser Arg Gly Ser Gly Asp 
15 20 25 

ggg ccg egg gtg aaa gta ttc aga gga tct tea tea ggt gac ccg aga 209 
Gly Pro Arg Val Lys Val Phe Arg Gly Ser Ser Ser Gly Asp Pro Arg 
30 35 40 

gcg gat cct cgt ata gag get tea aga gag aga aga get etc gag gaa 257 
Ala Asp Pro Arg lie Glu Ala Ser Arg Glu Arg Arg Ala Leu Glu Glu 
45 50 55 

get ccc agg cgt gaa ggt ggc ccg cca gag cga aag ccg tgg ggt gac 305 
Ala Pro Arg Arg Glu Gly Gly Pro Pro Glu Arg Lys Pro Trp Gly Asp 
60 65 70 

caa tat gat tac ctg aat ace cgt ccg gtt gag ctg gta tec aag aag 353 
Gin Tyr Asp Tyr Leu Asn Thr Arg Pro Val Glu Leu Val ser Lys Lys 
75 80 85 90 

gga ace gat gge gtc eeg gtc atg ctg cag acg aac ttt ttt cga nta 401 
Gly Thr Asp Gly Val Pro Val Net Leu Gin Thr Asn Phe Phe Arg Xaa 
95 100 105 
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aaa acc aag ccg gaa tgg egg ate gtt cat tat cac gtg gag ttt gtg 449 
Lys Thr Lys Pro 6lu Trp Arg Ue Val His Tyr His Val Glu Phe Val 
110 115 120 

ecg acc ate gag aat cct cgt gtc cgt atg gga gtt ttg tec aat eat 497 
Pro Thr He Glu Asn Pro Arg Val Arg Met Gly Val Leu Ser Asn His 
125 130 135 

get aac ctt ctg gga tea ggc tat eta ttc gac gga ctg caa ctg ttc 545 
Ala Asn Leu Leu Gly Ser Gly Tyr Leu Phe Asp Gly Leu Gin Leu Phe 
140 145 150 

ace acc agg aaa ttc gag eag gaa ate aeg gtg etc age gga aag teg 593 
Thr Thr Arg Lys Phe Glu Gin Glu lie Thr Val Leu Ser Gly Lys Ser 
155 160 165 17D 

aag ctg gac att gaa tac aag ata tec ata aag ttc gtt gga ttc ata 641 
Lys Leu Asp lie Glu Tyr Lys lie Ser lie Lys Phe Val Gly Phe lie 
175 180 185 

teg tgt get gag ecc cgc ttt ttg caa gtc tta aat eta ata ttg cgc 689 
Ser Cys Ala Glu Pro Arg Phe Leu Gin Val Leu Asn Leu I le Leu Arg 
190 195 200 

cgc teg atg aag ggc eta aat ttg gaa tta gtt ggc cgt aat etc ttt 737 
Arg ser Met Lys Gly Leu Asn Leu Glu Leu Val Gly Arg Asn Leu Phe 
205 210 215 

gat ecc cga get aag ate gaa ata agg gag ttc aaa atg gag eta tgg 785 
Asp Pro Arg Ala Lys Ue Glu lie Arg Glu Phe Lys Net Glu Leu Trp 
220 225 230 

ccg ggc tat gag aca teg att cgt eag cac gaa aaa gat att tta ttg 833 
Pro Gly Tyr Glu Thr Ser He Arg Gin His Glu Lys Asp He Leu Leu 
235 240 245 250 

ggc acc gaa ata act cac aaa gtt atg cgc acc gag aeg ate tac gac 881 
Gly Thr Glu He Thr His Lys Val Het Arg Thr Glu Thr He Tyr Asp 
255 260 265 

ata atg cga cgt tgc tea cac aat ccg get cgt cat cag gac gaa gta 929 
He Het Arg Arg Cys Ser His Asn Pro Ala Arg His Gin Asp Glu Val 
270 275 280 

egg gta aat gtt ttg gac ttg att gtc ctt aeg gat tac aat aac aga 977 
Arg Val Asn Val Leu Asp Leu He Val Leu Thr Asp Tyr Asn Asn Arg 
285 290 295 

act tat cgt ate aat gat gtc gac ttt gga caa act ccg aaa tea aca 1025 
Thr Tyr Arg He Asn Asp Val Asp Phe Gly Gin Thr Pro Lys Ser Thr 
300 305 310 
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ttc agt tgc aag ggt aga gat ate agt ttc gtg gaa tac tat etc act 1073 
Phe Ser Cys Lys Gly Arg Asp Ue Ser Phe Val Glu Tyr Tyr Leu Thr 
315 320 325 330 

aaa tat aat ata cgc att cgc gac cac aat cag ccg ctg ctg att tec 1121 
Lys Tyr Asn Ue Arg Ue Arg Asp His Asn Gin Pro Leu Leu Ue Ser 
335 340 345 

aaa aat agg gac aag get eta aaa act aac get age gaa tta gtg gta 1169 
Lys Asn Arg Asp Lys Ala Leu Lys Thr Asn Ala Ser Glu Leu Val Val 
350 355 360 

eta att cct gag etc tgc cga gtg act ggg etc aat gee gag atg cgc 1217 
Leu Ue Pro Glu Leu Cys Arg Val Thr Gly Leu Asn Ala Glu Met Arg 
365 370 375 

tea aac ttt cag ctt atg egt gee atg age agt tat acg cga atg aac 1265 
Ser Asn Phe Gin Leu. Met Arg Ala Met Ser Ser Tyr Thr Arg Met Asn 
380 385 390 

ccc aaa caa cgc act gat cga ttg cgc get ttt aac cac egt tta caa 1313 
Pro Lys Gin Arg Thr Asp Arg Leu Arg Ala Phe Asn His Arg Leu Gin 
395 400 405 410 

aac act eca gaa agt gtg aag gtc ttg aga gac tgg aac atg gaa ctg 1361 
Asn Thr Pro Glu Ser Val Lys Val Leu Arg Asp Trp Asn Met Glu Leu 
415 420 425 

gac aag aac gtc aca gaa gta caa ggc egg ata att gga cag cag aac 1409 
Asp Lys Asn Val Thr Glu Val Gin Gly Arg Ue Ue Gly Gin Gin Asn 
430 435 440 

ate gtg ttt cat aat gga aag gtt cct get gga gaa aac get gat tgg 1457 
Ue Val Phe His Asn Gly Lys Val Pro Ata Gly Glu Asn Ala Asp Trp 
445 450 455 

caa agg cac ttc aga gac caa agg atg ctt aec act ccg age gat ggc 1505 
Gin Arg His Phe Arg Asp Gin Arg Met Leu Thr Thr Pro Ser Asp Gly 
460 465 470 

etc gat egt tgg get gtc ate geg ceg eaa agg aat tec eat gaa etc 1553 
Leu Asp Arg Trp Ala Val Ue Ala Pro Gin Arg Asn Ser His Glu Leu 
475 480 485 490 

ega act eta ctt gac tct ttg tat aga gca get agt gga atg ggt ctt 1601 
Arg Thr Leu Leu Asp ser Leu Tyr Arg Ala Ala Ser Gly Met Gly Leu 
495 500 505 

aga att cga age ccc cag gaa ttc ata att tat gat gat cgc act gga 1649 
Arg Ue Arg Ser Pro Gin Glu Phe Ue Ue Tyr Asp Asp Arg Thr Gly 
510 515 520 
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act tat gtg aga gca atg gat gat tgt gtg cgc tea gat ccc aaa ctt 
Thr Tyr Val Arg Ala Het Asp Asp Cys Val Arg Sep Asp Pro Lys Leu 
525 530 535 



1697 



ata tta tgc etc gta ccc aat gat aac gcc gaa aga tac tea tea ate 
lie Leu Cys Leu Val Pro Asn Asp Asn Ala Giu Arg Tyr Ser Ser He 
540 545 550 



1745 



10 



aaa aag aga gga tac gtt gac agg gcg gtg cca act caa gtt gtg ace 1793 
Lys Lys Arg Gly Tyr Val Asp Arg Ala Val Pro Thr Gin Val Val Thr 
555 560 565 570 

ctt aaa acg acc aag aac cgt age ctt atg age att gcc acc aaa ata 1841 
Leu Lys Thr Thr Lys Asn Arg Ser Leu Met Ser He Ala Thr Lys He 
575 580 585 



15 



gca ate caa ctg aat tgc aag ttg gga tat aca ccc tgg atg ate gaa 
Ala He Gin Leu Asn Cys Lys Leu Gly Tyr Thr Pro Trp Het He Glu 
590 595 600 



1889 



eta ccc ttg tec gga ctg atg aca att ggc ttt gac att gcg aag age 
Leu Pro Leu Ser Gly Leu Met Thr He Gly Phe Asp He Ala Lys Ser 
605 610 615 



1937 



20 



aca ega gat egg aag agg gee tac gga gca ttg att gcc tea atg gat 
Thr Arg Asp Arg Lys Arg Ala Tyr Gly Ala Leu He Ala Ser Net Asp 
620 625 630 



1985 



25 



eta cag caa aac tec acg tac ttc age aca gtc acg gag tgc age gcc 
Leu Gin Gin Asn Ser Thr Tyr Phe Ser Thr Val Thr GLu Cys Ser Ala 
635 640 645 650 

ttt gat gtg etc get aac acc ctt tgg ccg atg ata gca aag gcc ctg 
Phe Asp Val Leu Ala Asn Thr Leu Trp Pro Het He Ala Lys Ala Leu 
655 660 665 



2081 



30 



cgc caa tat caa cat gag cat agg aag ctg cca tct cga ate gta ttt 
Arg Gin Tyr Gin His Glu His Arg Lys Leu Pro Ser Arg He Val Phe 
670 675 680 



2129 



tat cga gac ggt gtg age tec ggc tct eta aag cag ctt ttt gaa ttt 

Tyr Arg Asp Gly Val Ser Ser Gly Ser Leu Lys Gin Leu Phe Glu Phe 
685 690 695 



2177 



gaa gtc aag gac ate att gag aag ttg aaa act gaa tac gcc cgc gtc 
35 Glu Val Lys Asp He He Glu Lys Leu Lys Thr Glu Tyr Ala Arg Val 

700 705 710 



2225 



cag eta age cca ccg caa tta get tat att gtg gta acc aga tec atg 
Gin Leu Ser Pro Pro Gin Leu Ala Tyr He Val Val Thr Arg Ser Het 
715 720 725 730 



2273 
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aac acg cgc ttc ttc etc aac goa caa aat cct ccg cct ggt act ata 2321 
Asn Thr Arg Phe Phe Leu Asn Gly Gin Asn Pro Pro Pro Gly Thr I le 
735 740 745 

gtt gat gac gtt ata act ctg ccc gag aga tac gac ttt tat ctg gtc 2369 
Vat Asp Asp Val lie Thr Leu Pro Glu Arg Tyr Asp Phe Tyr Leu Val 
750 755 760 

teg caa caa gtt cgt cag ggt aca gtg teg ccg acc age tac aat gtt 2417 
Ser Gin Gin Val Arg Gin Gly Thr Val Ser Pro Thr Ser Tyr Asn Val 
765 770 775 

ctt tat age age atg ggt ntc tea ccg gag aaa atg caa aaa ctt acg 2465 
Leu Tyr Ser Ser Het Gly Xaa Ser Pro Glu Lys Net Gin Lys Leu Thr 
780 785 790 

tac aag atg tgc cac ttg tac tac aat tgg teg ggc acc aca cga gtg 2513 
Tyr Lys Het Cya His Leu Tyr Tyr Asn Trp Ser Gly Thr Thr Arg Val 
795 800 805 810 

cca gca gtt tgc cag tac get aag aag eta get acc etc gtg ggt acg 2561 
Pro Ala Val Cys Gin Tyr Ala Lys Lys Leu Ala Thr Leu Val Gly Thr 
815 820 825 

aac ttg cac tet att ccg caa aac gcg etc gaa aag aag ttt tat tat 2609 
Asn Leu His Ser lie Pro Gin Asn Ala Leu Glu Lys Lys Phe Tyr Tyr 
830 835 840 

eta taattggata taatttagaa tggagtatta atecttacta agaggccata 2662 
Leu 

tatgaaacta gcccagacat ttatactttt tcaatacttc cttacttttg ctaagcactt 2722 
cagcatttat gactaaatat tttgtatttg aaatgeatta ctgetetttt ttcaaacaaa 2782 
agcaaaattg aggattaaga ttctggtatt taagcataag accagaggaa attcccaaac 2842 
aaacatttaa agttatctat caagacatgt tcattaattt ggaatataat tactttattt 2902 
tttattgtat attttagttt atgtaaagaa aaattacata catccatgtt tgcttactta 2962 
accacacatt catggctgct tatattcgtg aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3022 
aaaaaaaaaa aaaaaaaaaa aaaaa 3047 
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<212> PRT 

<213> Dmsophila sp. 

<400> 2 

Net Ala Asp Asp Gin Gly Arg Gly Arg Arg Arg Pro Xaa Asn Glu Asp 
t 5 10 15 

Asp Scr Ser Thr Ser Arg Gly Ser Gly Asp Gly Pro Arg Val Lys Val 
20 25 30 

Phe Arg Gly Ser Ser Ser Gly Asp Pro Arg Ala Asp Pro Arg lie Glu 
35 40 45 

Ala Ser Arg Glu Arg Arg Ala Leu Glu Glu Ala Pro Arg Arg Glu Gly 
50 55 60 

Gly Pro Pro Glu Arg lys Pro Trp Gly Asp Gin Tyr Asp Tyr Leu Asn 
^ 70 75 80 

Thr Arg Pro Val Glu Leu Val Ser Lys Lys Gly Thr Asp Gly Val Pro 
85 90 95 

Val Met Leu Gin Thr Asn Phe Phe Arg Xaa Lys Thr Lys Pro Glu Trp 
100 105 110 

Arg He Val His Tyr His Val Glu Phe Val Pro Thr lie Glu Asn Pro 
115 120 125 

Arg Val Arg Net Gly Val Leu Ser Asn His Ala Asn Leu Leu Gly Ser 
130 135 140 

Gly Tyr Leu Phe Asp Gly Leu Gin Leu Phe Thr Thr Arg Lys Phe Glu 
145 150 155 160 

Gin Glu lie Thr Val Leu Ser Gly Lys Ser Lys Leu Asp lie Glu Tyr 
165 170 175 

Lys lie Ser lie Lys Phe Val Gly Phe lie Ser Cys Ala Glu Pro Arg 
180 185 190 

Phe Leu Gin VaL Leu Asn Leu He Leu Arg Arg Ser Net Lys Gly Leu 
195 200 205 

Asn Leu Glu Leu Val Gly Arg Asn Leu Phe Asp Pro Arg Ala Lys lie 
210 215 220 



Glu lie Arg Glu Phe Lys Met Glu Leu Trp Pro Gly Tyr Glu Thr Ser 
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225 



230 



235 



240 



Ue Arg Gin His Glu Lys Asp Ue leu Leu Gly Thr Glu lie Thr His 
245 250 255 

Lys Val Net Arg Thr Glu Thr He Tyr Asp lie Net Arg Apq Cys Ser 

260 265 270 

His Asn Pro Ala Arg His Gin Asp Glu Val Arg Val Asn Val Leu Asp 
275 280 285 

Leu Ue Val Leu Thr Asp Tyr Asn Asn Arg Thr Tyr Arg lie Asn Asp 
290 295 300 

Val Asp Phe Gly Gin Thr Pro Lys Ser Thr Phe Ser Cys Lys Gly Arg 
305 310 315 320 

Asp lie Ser Phe Val Glu Tyr Tyr Leu Thr Lys Tyr Asn lie Arg He 
325 330 335 

Arg Asp His Asn Gin Pro Leu Leu He Ser Lys Asn Arg Asp Lys Ala 
340 345 350 

Leu Lys Thr Asn Ala Ser Glu Leu Val Val Leu He Pro Glu Leu Cys 
355 360 365 

Arg Val Thr Gly Leu Asn Ala Glu Het Arg Ser Asn Phe Gin Leu Net 
370 375 380 

Arg Ala Net Ser Ser Tyr Thr Arg Net Asn Pro Lys Gin Arg Thr Asp 
385 . 390 395 400 

Arg Leu Arg Ala Phe Asn His Arg Leu Gin Asn Thr Pro Glu Ser Val 
405 410 415 

Lys val Leu Arg Asp Trp Asn Met Glu Leu Asp Lys Asn Val Thr Glu 
420 425 430 

Val Gin Gly Arg He He Gly Gin Gin Asn He Val Phe His Asn Gly 
435 aO 445 

Lys Val Pro Ala Gly Glu Asn Ala Asp Trp Gin Arg His Phe Arg Asp 
450 455 460 

Gin Arg Met Leu Thr Thr Pro Ser Asp Gly Leu Asp Arg Trp Ala Val 
465 470 475 480 



He Ala Pro Gin Arg Asn ser His Glu Leu Arg Thr Leu Leu Asp Ser 
485 490 495 



Leu Tyr Arg Ala Ala Ser Gly Met Gly Leu Arg He Arg Ser Pro Gin 
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505 510 



Glu Phe lie He Tyr Asp Asp Arg Thr Gly Thr Typ Val Arg Ala Met 
515 520 . 525 

Asp Asp Cys Val Arg Sen Asp Pro Lys Leu I le Leu Cys Leu Val Pro 
530 535 540 

Asn Asp Asn Ala Glu Arg Tyr Ser Ser lie Lys Lys Arg Gly Tyr Val 
545 550 555 560 

Asp Arg Ala Val Pro Thr Gin Val Val Thr Leu Lys Thr Thr Lys Asn 
565 570 575 

Arg Ser leu Met Ser. lie Ala Thr Lys lie Ala lie Gin Leu Asn Cys 
580 585 590 

Lys Leu Gly Tyr Thr Pro Trp Net He Glu Leu Pro Leu Ser Gly Leu 
595 600 605 

Net Thr He Gly Phe Asp lie Ala Lys Ser Thr Arg Asp Arg Lys Arg 
610 615 620 

Ala Tyr Gly Ala Leu He Ala Ser Net Asp Leu Gin Gin Asn Ser Thr 
625 630 635 640 

Tyr Phe Ser Thr Val Thr Glu Cys Ser Ala Phe Asp Val Leu Ala Asn 
645 650 655 

Thr Leu Trp Pro Net He Ala Lys Ala Leu Arg Gin Tyr Gin His Glu 
660 665 670 

His Arg Lys Leu Pro Ser Arg He Val Phe Tyr Arg Asp Gly Val Ser 
675 680 685 

Ser Gly Ser Leu Lys Gin Leu Phe Glu Phe Glu Val Lys Asp He He 
690 695 700 

Glu Lys Leu L^ Thr Glu Tyr Ala Arg Val Gin Leu Ser Pro Pro Gin 
705 710 715 720 

Leu Ala Tyr He Val Val Thr Arg Ser Met Asn Thr Arg Phe Phe Leu 
725 730 755 

Asn Gly Gin Asn Pro Pro Pro Gly Thr Ue Val Asp Asp Val He Thr 
740 745 750 



Leu Pro Glu Arg Tyr Asp Phe Tyr Leu Val Ser Gin Gin Val Arg Gin 
755 760 765 



Gly Thr Val Ser Pro Thr Ser Tyr Asn Val Leu Tyr Ser Ser Met Gly 
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370 775 780 

Xaa Ser Pro Glu Lys Het 6ln Lys Leu Thr Tyr Lys Met Cys His Leu 
785 790 795 800 

Tyr Tyr Asn Trp Ser 6ly Thr Thr Arg V«l Pro Ala Val Cys Gin Tyr 
5 805 810 815 

Ala Lys Lys Leu Ala Thr Leu Val Gly Thr Asn Leu His Ser He Pro 
820 825 850 

Gin Asn Ala Leu Glu Lys Lys Phe Tyr Tyr Leu 
855 840 

10 <210> 3 

<211>4064 
<212> DNA 
<213> Mussp. 
<220> 
15 <221> CDS 

<222> (191)..(2776) 

<220> 

<221> miscjeature 
<222> (450) 
20 <223> n^a or c, Xaa^Leu or lie 

<220> 

<221> miscjieature 
<222> (1337) 

<223> n=a or c, Xaa=Leu or lie 
25 <220> 
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<221> miscjeature 
<222> (2636) 

<223> n=a or c, Xaa=Leu or lie 
<400>3 

agaagcctgc tacagggggt ggcgtgagga gcctcaagga ctgcggacct aagcgtgcta 60 

agcacggggc caggatagga ctggacgctg ctggaccacg acgatcaggg agtgaccccc 120 

accagcatcc agcctgcggc agttgtagga ctggaaggag gccaagctga acacctgagc 180 

tgactggaaa atg act ggc cga gcc cga get egg gcc cgc ggc agg gca 2Z9 
Met Thr Gly Arg Ala Arg Ala Arg Ala Arg Gly Arg Ala 
1 5 10 

cga ggt cag gag acg gtg cag cat gtt ggg get get gcg age cag caa 277 
Arg Gly Gin GLu Thr Val Gin Hia Val Gly Ala Ala Ala Ser Gin Gin 
15 20 25 

cct ggg tac ate cca ccg aga cct caa cag tec ccc aca gag ggg gac 325 
Pro Gly Tyr lie Pro Pro Arg Pro Gin Gin Ser Pro Thr Glu Gly Asp 
30 35 40 45 

ttg gtt ggc cga gga cga cag agg ggg atg gta gtc gga gcc aca tec 373 
teu val Gly Arg Gly Arg Gin Arg Gly Met Val Val Gly Ala Thr Ser 
50 55 60 

aag tea caa gaa ctg cag ate tea get ggg ttt cag gag ctg tea ctg 421 
Lya Ser Gin Glu leu Gin lie Ser Ala Gly Phe Gin Glu Leu Ser Leu 
65 70 75 

gca gag aga gga ggg cgt cgc cga gac ttc cat gac ntt ggt gtg aac 469 
Ala Glu Arg Gly Gly Arg Arg Arg Asp Phe His Asp Xaa Gly Val Asn 
80 85 90 

acc aga cag aac ctt gac eat gtc aaa gag tea aag aca ggc tec tct 517 
Thr Arg Gin Asn Leu Asp His Val Lys Glu Ser Lys Thr Gly Ser Ser 
95 100 105 

ggc ate att gtg aag ctg age acc aac cac ttc egg ctg acc teg cgc 565 
Gly lie He Val Lys Leu Ser Thr Asn His Phe Arg Leu Thr Ser Arg 
110 115 120 125 

cca cag tgg gcc ctg tat cag tac cac ate gac tac aat ccc ctg atg 613 
Pro Gin Trp Ala Leu Tyr Gin Tyr His He Asp Tyr Asn Pro Leu Met 
130 135 140 
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gaa gcc cga agg ctt cgc tec gca ctg etc ttc cag cat gaa gac etc 
Glu Ala Arg Arg Leu Arg Ser Ala Leu Leu Phe Gin His 61u Asp Leu 
145 150 155 



661 



att gga agg tgt cat get ttc gat ggg aca ata ttg ttt tta cct aag 709 
lie Gly Arg Cys Hfs Ala Phe Asp Gly Thr lie Leu Phe Leu Pro Lys 
160 165 170 



aga eta cag cac aag gtc aca gaa gta ttc agt cag act egg aat ggg 757 
Arg Leu Gin His Lys Val Thr Glu Val Phe Ser Gin Thr Arg Asn Gly 
175 180 185 

gaa cac gtg agg ate acc ate ace ctg acc aac gag ctg ccg ccc ace 805 
Glu His Val Arg He Thr He Thr Leu Thr Asn Glu Leu Pro Pro Thr 
1W 195 200 205 



15 



teg ccc acc tgc ctg cag ttc tat aac ate ntc ttc agg agg etc ttg 
Ser Pro Thr Cys Leu Gin Phe Tyr Asn I le Xaa Phe Arg Arg Leu Leu 
210 215 220 



853 



aaa ate atg aat ttg caa caa att gga egg aat tat tac aat cca agt 901 
Lys He Met Asn Leu Gin Gin He Gly Arg Asn Tyr Tyr Asn Pro Ser 
225 230 235 



20 



gac ccg att gat att cca aac cac agg ttg gtg ate tgg ccc gge ttc 
Asp Pro He Asp He Pro Asn Hfs Arg Leu Val He Trp Pro Gly Phe 
240 245 250 



949 



acc acc tec ate ctt cag tat gag aac aac ate atg etc tgc aca gac 997 
Thr Thr Ser He Leu Gin Tyr Glu Asn Asn He Met Leu Cys Thr Asp 
255 260 265 

gtc age cac aag gtg etc cgc age gag act gtc eta gac ttc atg ttc 1045 
Val Ser His Lys Val Leu Arg Ser Glu Thr Val Leu Asp Phe Met Phe 
270 275 280 285 



30 



aat eta tac cag cag aca gag gag cac aag ttc cag gag caa gtg teg 
Asn Leu Tyr Gin Gin Thr Glu Glu His Lys Phe Gin Glu Gin Val Ser 
290 295 300 



1093 



aag gag etc ata ggc etc ate gtt etc acc aag tac aat aac aag acc 1141 
Lys Glu Leu He Gly Leu He Val Leu Thr Lys Tyr Asn Asn Lys Thr 
305 310 315 



tac egg gtg gat gac att gac tgg gac cag aat cca aag age ace ttc 
35 Tyr Arg Val Asp Asp He Asp Trp Asp Gin Asn Pro Lys Ser Thr Phe 

320 325 330 



1189 



aag aag gcg gat ggc teg gag gtc age ttc ctg gag tac tac agg aag 
Lys Lys Ala Asp Gly Ser Glu Val Ser Phe Leu Glu Tyr Tyr Arg Lys 
335 340 345 



1237 



10 
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caa tac aac cag gag ate acg gac ctg aag cag ccg gtg ctg gtg age 1285 
Gin Tyr Aan Gin Glu lie Thr Asp Leu Lys Gin Pro Val Leu Val Ser 
350 355 360 365 

caa ccc aag egg agg aga ggc ccc ggc ggc ace etg cot ggc cca get 1333 
Gin Pro Lys Arg Arg Arg Gly Pro Gly Gly Thr Leu Pro Gly Pro Ala 
370 375 380 

atg ntc ate cct gaa etc tgc tat cte aca ggc ctg aet gat aaa atg 1381 
Met Xaa lie Pro Glu Leu Cys Tyr Leu Thr Gly Leu Thr Asp Lys Net 
385 390 395 

egc aat gat ttc aat gtg atg aag gac ctg gca gtg cac acg egg ctg 1429 
Arg Asn Asp Phe Asn Val Met Lys Asp Leu Ala Val His Thr Arg Leu 
400 405 410 



acc cct gag cag egg cag egg gag gtg gge cgc etc ate gac tac ate 1477 
Thr Pro Glu Gin Arg Gin Arg Glu Val Gly Arg Leu He Asp Tyr He 
15 415 420 425 

cae aag gat gac aat gtg cag aga gag ctt cga gac tgg ggc ctg age 1525 
His Lys Asp Asp Asn Val Gin Arg Glu Leu Arg Asp Trp Gly Leu Ser 
430 435 440 445 

ttc gac tea aac ttg ctg tec ttc tct gga aga ate tta caa tct gag 1573 
20 Phe Asp Ser Asn Leu Leu Ser Phe Ser Gly Arg He Leu Gin Ser Glu 
450 455 460 

aag ate cae cag ggc gga aag acg ttt gat tac aac cca caa ttt gca 1621 
Lys He His Gin Gly Gly Lys Thr Phe Asp Tyr Asn Pro Gin Phe Ala 
465 470 475 

25 gac tgg tec aaa gaa aca aga ggc gcg ccg ctg ate age gtg aag cca 1669 

Asp Trp Ser Lys Glu Thr Arg Gly Ala Pro Leu He Ser Val Lys Pro 
480 485 490 

ttg gat aac tgg ctg etg ate tat acc cgc agg aat tat gaa gca gee 1717 
Leu Asp Asn Trp Leu Leu He Tyr Thr Arg Arg Asn Tyr Glu Ala Ala 
30 495 500 505 

aac tea etg ata cag aac ctg ttc aaa gtg act cca gcc atg ggc ate 1765 
Asn Ser Leu He Gin Asn Leu Phe Lys Val Thr Pro Ala Met Gly He 
510 515 520 525 

cag atg aaa aag gca ate atg ate gag gtg gat gac aga aca gaa get 1813 
35 Gin Met Lys Lys Ala He Met He Glu Val Asp Asp Arg Thr Glu Ala 
530 535 540 

tat ctg aga gee ttg cag cag aag gtg acg tea gac aet cag ata gtt 1861 
Tyr Leu Arg Ala Leu Gin Gin Lys Val Thr Ser Asp Thr Gin He Val 
545 550 555 
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gtc tgt etc ttg tea agt aat egg aag gac aaa tat gat gee ate aag 1909 
Vat Cya Leu Leu $er Ser Asn Arg Lys Asp Lys Tyr Asp Ala He Lys 
560 565 570 

aag tac ttg tgt aca gac tgc ccc acc cca agt cag tgt gtg gtg gee 1957 
Lys Tyr Leu Cys Thr Asp Cys Pro Thr Pro Ser Gin Cys Val Val Ala 
575 580 585 

egg acc ctg ggc aag cag caa aca gtc atg gcc att gcc acc aag ate 2005 
Arg Thr Leu Gly Lys Gin Gin Thr Val Het Ala Ue Ala Thr Lys lie 
590 595 600 605 

gcc ctg cag atg aac tgc aag atg gga ggc gag etc tgg egg gtg gac 2053 
Ala Leu Gin Met Asn Cys Lys Met Gly Gly Glu Leu Trp Arg Val Asp 
610 615 620 

atg gcc ctg aaa ctg gca atg ate gtg ggc ate gac tgt tac cat gac 2101 
Met Ala Leu Lys Leu Ala Het lie Val Gly lie Asp Cys Tyr Hfs Asp 
625 630 635 

acc aca get ggg egg agg tec ate gca gga ttc gtc gcc age ate aat 2149 
Thr Thr Ala Gly Arg Arg Ser lie Ala Gly Phe Val Ala Ser lie Asn 
640 645 650 

gaa ggg atg acc egc tgg ttc tec cge tgc gtc ttt cag gac cgc ggg 2197 
Glu Gly Met Thr Arg Trp Phe Ser Arg Cys Val Phe Gin Asp Arg Gly 
655 660 665 

cag gag ctg gtg gat ggt etc aag gtg tgc ttg caa get get ctg agg 2245 
Gin Glu Leu Val Asp Gly Leu Lys Val Cys Leu Gin Ala Ala Leu Arg 
670 675 680 685 

get tgg agt ggc tgc aat gaa tac atg ccc age cgt gtc ate gtg tac 2293 
Ala Trp Ser Gly Cys Asn Glu Tyr Het Pro Ser Arg Val lie Val Tyr 
690 695 700 

ega gac ggt gtg ggg gac ggg cag ctg aag acc ctg gtc aat tat gag 2341 
Arg Asp Gly Val Gly Asp Gly Gin Leu Lys Thr Leu Val Asn Tyr Glu 
705 710 715 

gtc cca cag ttc eta gat tgc etc aag tea gtc ggg aga ggt tac aac 2389 
Val Pro Gin Phe Leu Asp Cys Leu Lys Ser Val Gly Arg Gly Tyr Asn 
720 725 730 

cca aga ctg act gta ate gtg gtg aag aag cgt gtc aat gcc agg ttt 2437 
Pro Arg Leu Thr Vat He Val Val Lys Lys Arg Val Asn Ala Arg Phe 
735 740 745 

ttt get cag tct ggg gga aga ett cag aac cet ett cca ggg aca gtc 2485 
Phe Ala Gin Ser Gly Gly Arg Leu Gtn Asn Pro Leu Pro Gly Thr Val 
750 755 760 765 
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ate gat gtg gaa gtc acc aga cca gag tgg tat gac ttt ttc ate gtg 2533 
lie Asp Val Glu Val Thr Arg Pro Glu Trp Tyr Asp Phe Phe He Val 
770 775 780 

age cag gca gtg aga age ggg agt gtg tee cca aca cac tae aat gtc 2581 
Ser Gin Ala Val Arg Ser Giy Ser Val Ser Pro Thr His Tyr Asn Val 
785 790 795 

ate tat gac age agt ggc ctg aag ccc gac cac ate cag egg ctg aca 2629 
lie Tyr Asp Ser Ser Gly Leu Lys Pro Asp His lie Gin Arg Leu Thr 
800 805 810 

tae aag ntc tgc cac gtg tac tat aat tgg cct gga gtc ate ega gtc 2677 
Tyr Lys Xaa Cys His Val Tyr Tyr Asn Trp Pro Gly Val He Arg Val 
815 820 825 

cct gca cct tgc cag tat gca cac aag ctg gee ttc etc gtg ggc cag 2725 
Pro Ala Pro Cys Gin Tyr Ala His Lys Leu Ala Phe Leu Val Gly Gin 
830 835 840 845 

age ate cac aga gag cca aac etc tec ctg tec aac cgc etc tac tac 2773 
Ser He His Arg Glu Pro Asn Leu Ser Leu Ser Asn Arg Leu Tyr Tyr 
850 1^5 860 

etc taaceageag cgagggccgg catgctggtt tecttcagag ctctgcccaa 2826 
Leu 

tecttcagtt ctatttttca taggatgagg ttctagcggg atgttggggg taggggttgg 2886 
gggccgctgt tttgttttgt tttgttttta gttgttggtg tgtagcggct atagagttaa 2946 
ctaaggctta tgtttgcttt gtttctcatg gtgtgttgtg acagtttgct tggcagagtg 3006 
gtggcaaggg eegtggtace acagatagac tttcctaaag cagccgtgtt ggcaggttct 3066 
tggaggtgct cctgggagca cctcctattg tacgctgctc tctctctctc tctctctctc 3126 
tctctctctc tctctctctc tctctctctc tgcatgtttg aatttaaaag aggaaaggac 3186 
acaaggcaag aatgcecaga geatcctaga tgggtgctat atgggactct gacacctcag 3246 
gttcttaaag ataaaaaggc cagaggtace tttctgtaat tettttaaga gttaggtcat 3306 
aaggaatgaa aagagttact tttctccctc catgaaagtg ttttttaaga ctgacaaggc 3366 
cggtggtgga gtgagtgttg ttgttttgtt tcgcaatttt cttaccaagt gaaattactg 3426 
accataattt tatttaatag tttttagttt ttgaaagcct gtgttttctt tgcttttaaa 3486 
ctacaggcga ccactctggg ataggattcc actcccaaaa gtaatgtgtt gggtgtggtg 3546 



WO0(M32039 PCT/US99/28764 

-16- 

gcacacgcct stgatctcag gactcaggag gtgaagccag gggaacccca agttcccggt 3606 
cagcctaggc tacatagtga gactttgtct taacaaacag aaaaaaagaa aaagaaagaa 3666 
aaaaaaaaga aagaaaaaaa aaraaagaaa aatggagcag tatgagttat gggtgccagt 3726 
ttctgcagga gactttggga gtaagagctg cctctgttcc taagaggctg caaggctgcg 37S6 
5 gctggggggc cagttcaggt gccaggtgag gtacggccac atctatttgc ctggctcttt 3846 
gacaaagaga gcagcttaga gtggctagag tgtgatgagt tttacacaat ggcgatgtgt 3906 
gtgtgtgccc ctggcacagc tcaataaagt ttctgtctca ctttaaaaaa aaaaaaaaaa 3966 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4026 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 4064 

10 <210>4 

<211>862 
<212> PRT 
<213> Musap. 



<400>4 

15 Met Thr Giy Arg Ala Arg Ala Arg Ala Arg Gly Arg Ala Arg 6ly Gin 
15 10 15 

Glu Thr Val Gin His Val Gly Ala Ala Ala Ser Gin Gin Pro Gly Tyr 
20 25 30 

lie Pro Pro Arg Pro Gin Gin Ser Pro Thr Glu Gly A^p Leu Val Gly 
20 35 40 45 

Arg Gly Arg Gin Arg Gly Net Val Val fily Ala Thr Ser tys Ser Gin 
50 55 60 

Glu Leu Gin lie Ser Ala Gly Phe Gin Glu Leu Ser Leu Ala Glu Arg 
65 70 75 80 

25 Gly Gly Arg Arg Arg Asp Phe His Asp Xaa Gly Val Asn Thr Arg Gin 
85 90 95 



Asn Leu Asp His Val Lys Glu Ser tys Thr Gly Ser Ser Gly Ue lie 
100 105 110 
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Val Lys Leu Ser Thr Asn His Phe Arg Leu Thr Ser Arg Pro Gin Trp 
115 120 125 

Ala Leu Tyr Gin Tyr His He Asp Tyr Asn Pro Leu Met Glu Ala Arg 
130 135 140 

Arg Leu Arg Ser Ala Leu Leu Phe Gin His 6Lu Asp Leu He Gly Arg 
1*5 150 155 160 

Cys His Ala Phe Asp Gly Thr lie Leu Phe Leu Pro Lys Arg Leu Gin 
165 170 175 

His Lys Val Thr Glu Val Phe Ser Gin Thr Arg Asn Gly Glu His Val 
180 185 190 

Arg He Thr He Thr Leu Thr Asn Glu Leu Pro Pro Thr Ser Pro Thr 
195 200 205 

Cya Leu Gin Phe Tyr Asn He Xaa Phe Arg Arg Leu Leu Lys He Met 
210 215 220 

Asn Leu Gin Gin He Gly Arg Asn Tyr Tyr Asn Pro Ser Asp Pro He 
225 230 255 240 

Asp He Pro Asn His Arg Leu Val He Trp Pro Gly Phe Thr Thr Ser 
245 250 255 

He Leu Gin Tyr Glu Asn Asn He Met Leu cys Thr Asp Val ser His 
260 265 270 

Lys Val Leu Arg Ser Glu Thr Val Leu Asp Phe Met Phe Asn Leu Tyr 
275 280 28S 

Gin Gin Thr Glu Glu His lys Phe Gin Glu Gin Val Ser Lys Glu Leu 
290 295 300 

He Gly Leu He Val Leu Thr Lys Tyr Asn Asn Lys Thr Tyr Arg Val 
3W 310 315 320 

Asp Asp He Asp Trp Asp Gin Asn Pro Lys Ser Thr Phe Lys Lys Ala 
325 330 335 

Asp Gly Ser Glu Val Ser Phe Leu Glu Tyr Tyr Arg Lys Gin Tyr Asn 
3A0 345 350 



Gin Glu He Thr Asp Leu Lys Gin Pro Val Leu Val Ser Gin Pro Lys 
355 360 365 



Arg Arg Arg Gly Pro Gly Gly Thr Leu Pro Gly Pro Ala Met Xaa He 
370 375 380 
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Pro Glu Leu Cys Tyr Leu Thr eiy Leu Thr Asp Lys Met Arg Asn Asp 
390 395 400 

Phe Asn Val Met Lys Asp Leu Ala Val His Thr Arg Leu Thr Pro Glu 
405 410 415 

Gin Arg Gin Arg Glu Val Gly Arg Leu He Asp Tyr lie His Lys Asp 
A20 425 430 

Asp Asn Val Gin Arg Glu Leu Arg Asp Trp Gly Leu Ser Phe Asp Ser 
435 440 445 

Asn Leu Leu Ser Phe Ser Gly Arg He Leu Gin Ser Glu Lys He His 
450 455 460 

Gin Gly Gly Lys Thr Phe Asp Tyr Asn Pro Gin Phe Ala Asp Trp Ser 
470 475 4ao 

Lys Glu Thr Arg Gly Ala Pro Leu He Ser Val Lys Pro Leu Asp Asn 
485 490 495 

Trp Leu Leu He Tyr Thr Arg Arg Asn Tyr Glu Ala Ala Asn Ser Leu 
500 505 510 

He Gin Asn Leu Phe Lys Val Thr Pro Ala Met Gly He Gin Met Lys 
515 520 525 

Lys Ala He Met He Glu Val Asp Asp Arg Thr Glu Ala Tyr Leu Arg 
530 535 540 

Ala Leu Gin Gin Lys Val Thr Ser Asp Thr Gin He Val Val Cys Leu 
545 550 555 560 

Leu Ser Ser Asn Arg Lys Asp Lys Tyr Asp Ala He Lys Lys Tyr Leu 
565 570 575 

Cys Thr Asp Cys Pro Thr Pro Ser Gin Cys Val Val Ala Arg Thr Leu 
580 585 590 

Gly Lys Gin Gin Thr Val Met Ala He Ala Thr Lys He Ala Leu Gin 
595 600 605 

Met Asn Cys Lys Met Gly Gly Glu Leu Trp Arg Val Asp Met Ala Leu 
610 615 620 



Lys Leu Ala Met He Val Gly He Asp Cys Tyr His Asp Thr Thr Ala 
625 630 635 640 



Gly Arg Arg Ser He Ala Gly Phe Val Ala Ser He Asn Glu Gly Met 
645 650 655 
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Thr Arg Trp Phe Ser Arg Cys Val Phe Gin Asp Arg Gly Gin Glu Leu 
660 665 670 

Val Asp Gly Leu Lys Val Cys Leu Gin Ala Ala Leu Arg Ala Trp Ser 
675 680 685 

Gly Cys Asn Glu Tyr Met Pro Ser Arg Val lie Val Tyr Arg Asp Gly 
690 695 700 

Val Gly Asp Gly Gin Leu Lys Thr Leu Val Asn Tyr Glu Val Pro Gin 
705 710 715 720 

Phe Leu Asp Cys Leu Lys Ser Val Gly Arg Gly Tyr Asn Pro Arg Leu 
725 730 735 

Thr Val lie Val Val Lys Lys Arg Val Asn Ala Arg Phe Phe Ala Gin 
740 745 750 

Ser Gly Gly Arg Leu Gin Asn Pro Leu Pro Gly Thr Val He Asp Val 
755 760 765 

Glu Val Thr Arg Pro Glu Trp Tyr Asp Phe Phe He Val Ser Gin Ala 
770 775 780 

Val Arg Ser Gly Ser Val Ser Pro Thr His Tyr Asn Val He Tyr Asp 
785 790 795 800 

Sen ser Gly Leu Lys Pro Asp His He Gin Arg Leu Thr Tyr Lys Xaa 
805 810 815 

Cys His Val Tyr Tyr Asn Trp Pro Gly Val lie Arg Val Pro Ala Pro 
820 825 830 

cys Gin Tyr Ala His Lys Leu Ala Phe Leu Val Gly Gin Ser He Hfs 
835 840 845 

Arg Glu Pro Asn Leu Ser Leu Ser Asn Arg Leu Tyr Tyr Leu 
850 855 860 

<210>5 
<211>3472 
<212> DNA 
<213> Homo sapiens 
<220> 
<221>CDS 
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<222> (167)..(2749) 
<220> 

<221> misc_feature 
<222> (392) 
5 <223> n=t or a» Xaa=Leu or He 

<220> 

<221> miscjeature 
<222> (1073) 

<223> n=c or a. Xaa=Leu or lie 

10 <220> 

<221> miscjeature 
<222> (2369) 

<223> n=c or a, Xaa=Leu or He 
<:400>5 

15 gggcgaattg ggcccsacgt cgcatgctcc cggccgccat ggcggccgcg ggaattcgat 60 

tccatcctaa tacgactcac tatagggctc gagcggccgc ccgggcaggt ctgaggtgca 120 

aggaccagga ctagggcgag ggcagcggtc caagaaatag aaaaca atg act ggg 175 

Het Thr Gly 
1 

20 aga gcc cga gcc aga gcc aga gga agg gcc cgc ggt cag gag aca gcg 223 
Arg Ala Arg Ala Arg Ala Arg Gly Arg Ala Arg Gly Gin Glu Thr Ala 
5 10 15 

cag ctg gtg ggc tec act gcc agt cag caa cct ggt tat att cag cct 271 
Gin Leu Val Gly Ser Thr Ala Ser Gin Gin Pro Gly Tyr He Gin Pro 
25 20 25 30 35 



PCT/US99y2S764 
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agg cct cag ccg cca cca gca gag ggg gaa tta ttt ggc cgt gga egg 



319 
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Arg Pro Gin Pro Pro Pro Ala Glu Gly Glu Leu Phe Gly Arg Gly Arg 
40 45 50 



cag aga gga aca gca gga gga aca gcc aag tea caa gga etc cag ata 
Gin Arg Gly Thr Ala Gly Gly Thr Ala Lys Ser Gin Gly Leu Gin He 
55 60 6S 



367 



tct get gga ttt cag gag tta teg nta gca gag aga gga ggt cgt egt 
Ser Ala Gly Phe Gin Glu Leu Ser Xaa Ala Glu Arg Gly Gly Arg Arg 
70 75 80 



415 



aga gat ttt cat gat ctt ggt gtg aat aca agg cag aac eta gac cat 
10 Arg Asp Phe His Asp Leu Gly VaL Asn Thr Arg Gin Asn Leu Asp His 
85 90 95 



463 



gtt aaa gaa tea aaa aca ggt tct tea ggc att ata gta agg tta age 
Val Lys Glu Ser Lys Thr Gly Ser Ser Gly lie lie Val Arg Leu Ser 
100 105 110 115 



511 



15 act aac cat ttc egg ctg aca tec egt ecc cag tgg gee tta tat cag 
Thr Asn His Phe Arg Leu Thr Ser Arg Pro Gin Trp Ala Leu Tyr Gin 
120 125 130 



559 



20 



tat cac att gac tat aac cca ctg atg gaa gcc aga aga etc egt tea 
Tyr His lie Asp Tyr Asn Pro Leu Met Glu Ala Arg Arg Leu Arg Ser 
135 140 145 



607 



get ctt ctt ttt caa cac gaa gat eta att gga aag tgt eat get ttt 
Ala Leu Leu Phe Gin His Glu Asp Leu lie Gly Lys Cys His Ala Phe 
150 155 160 



655 



gat gga acg ata tta ttt tta cct aaa aga eta eag caa aag gtt act 
25 Asp Gly Thr lie Leu Phe Leu Pro Lys Arg Leu Gin Gin Lys Val Thr 
165 170 175 



703 



gaa gtt ttt agt aag occ egg aat gga gag gat gtg agg ata acg ate 
Glu Val Phe Ser Lys Thr Arg Asn Gly Glu Asp Val Arg He Thr tie 
180 185 190 195 



751 



30 act tta aca aat gaa ctt cca cct aca tea cca act tgt ttg cag ttc 

Thr Leu Thr Asn Glu Leu Pro Pro Thr Ser Pro Thr Cys Leu Gin Phe 
200 205 210 



799 



tat aat att att ttc agg agg ctt ttg aaa ate atg aat ttg caa caa 
Tyr Asn lie lie Phe Arg Arg Leu Leu Lys lie Het Asn Leu Gin Gin 
35 215 220 225 

att gga cga aat tat tat aac cca aat gac cca att gat att cca agt 
lie Gly Arg Asn Tyr Tyr Asn Pro Asn Asp Pro He Asp He Pro Ser 
230 235 240 



847 



895 
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cac a99 ttg gtg att tgg cct ggc ttc act act tec ate ctt cag tat 943 
His Arg Leu Val Ue Trp Pro Gly Phe Thr Thr Ser lie Leu Gin Tyr 
245 250 255 

gaa aac ago ate atg etc tge act gac gtt age cat aaa gtc ctt cga 991 
Glu Asn Ser Ue Het Leu Cys Thr Asp Val Ser His Lys Val Leu Arg 
260 265 270 275 

agt gag act gtt ttg gat ttc atg ttc aac ttt tat cat cag aca gaa 1039 
Ser Glu Thr Val Leu Asp Phe Met Phe Asn Phe Tyr His Gin Thr Glu 
280 285 290 

gaa cat aaa ttt caa gaa caa gtt tec aaa gaa nta ata ggt tta gtt 1087 
Glu His Lys Phe Gin Glu Gin Val Ser Lys Glu Xaa He Gly Leu Val 
29S 300 305 

gtt ctt aec aag tat aac att aag aca tac aga gtg gat gat att gac 1135 
Val Leu Thr Lys Tyr Asn lie Lys Thr Tyr Arg Val Asp Asp He Asp 
310 315 320 

tgg gac cag aat ccc aag age acc ttt aag aaa gcc gac ggc tot ggg 1183 
Trp Asp Gin Asn Pro Lys Ser Thr Phe Lys Lys Ala Asp Gly Ser Gly 
325 330 335 

gtc age ttc tta gaa tac tac agg aag caa tac aac caa gag ate acc 1231 
Val Ser Phe Leu Glu Tyr Tyr Arg Lys Gin Tyr Asn Gin Glu lie Thr 
340 345 350 355 

gac ttg aag cag cct gtc ttg gtc age cag ccc aag aga agg egg ggc 1279 
Asp Leu Lys Gin Pro Val Leu Val Ser Gin Pro Lys Arg Arg Arg Gly 
360 365 370 

cct ggg ggg aca ctg oca ggg cct gcc atg etc att cct gag etc tgc 1327 
Pro Gly Gly Thr Leu Pro Gly Pro Ala Het Leu lie Pro Glu Leu Cys 
375 380 385 

tat ctt aca ggt eta act gat aaa atg cgt aat gat ttt aac gtg atg 1375 
Tyr Leu Thr Gly Leu Thr Asp Lys Met Arg Asn Asp Phe Asn Val Het 
390 395 400 

aaa gac tta gcc gtt cat aca aga eta act cca gag caa agg cag cgt 1423 
Lys Asp Leu Ala Val His Thr Arg Leu Thr Pro Glu Gin Arg Gin Arg 
405 410 415 

gaa gtg gga cga etc att gat tac att cat aaa aac gat aat gtt caa 1471 
Glu Val Gly Arg Leu Ue Asp Tyr Ue His Lys Asn Asp Asn Val GLn 
420 425 430 435 

agg gag ctt cga gac tgg ggt ttg age ttt gat tec aac tta ctg tec 1519 
Arg Glu Leu Arg Asp Trp Gly Leu Ser Phe Asp Ser Asn Leu Leu Ser 
440 445 450 
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ttc tea gga aga att tt9 caa aca gaa aag att cac caa ggt gga aaa 1567 
Phe Ser 6ly Arg lie Leu Gin Thr Glu Lys lie His Gin Gly Gly Lys 
455 460 465 

aca ttt gat tac aat cca caa ttt gca gat tgg tec aaa gaa aca aga 1615 
5 Thr Phe Asp Tyr Asn Pro Gin Phe Ala Asp Trp Ser Lys Glu Thr Arg 
470 480 

ggt gca cca tta att agt gtt aag cca eta gat aac tgg ctg ttg ate 1663 
Gly Ala Pro Leu lie Ser Val Lys Pro Leu Asp Asn Trp Leu Leu lie 
485 490 , 495 

10 tat aeg ega aga aat tat gaa gca gee aat tea ttg ata caa aat eta 1711 
Tyr Thr Arg Arg Asn Tyr Glu Ala Ala Asn Ser Leu lie Gin Asn Leu 
500 505 510 515 

ttt aaa gtt aca cca gee atg ggc atg caa atg aga aaa gca ata atg 1759 
Phe Lys Val Thr Pro Ala Net Gly Net Gin Met Arg Lys Ala lie Met 
15 520 525 530 

att gaa gtg gat gac aga act gaa gcc tac tta aga gtc tta cag caa 1807 
lie Glu Val Asp Asp Arg Thr Glu Ala Tyr Leu Arg Val Leu Gin Gin 
535 540 545 

aag gtc aca gca gac ace cag ata gtt gtc tgt ctg ttg tea agt aat 1855 
20 Lys val Thr Ata Asp Thr Gin lie Val Val Cys Leu Leu Ser Ser Asn 
550 555 560 

egg aag gac aaa tac gat get att aaa aaa tac ctg tgt aca gat tge 1903 
Arg Lys Asp Lys Tyr Asp Ala lie Lys Lys Tyr Leu Cys Thr Aap Cys 
565 570 575 

25 cet acc cca agt cag tgt gtg gtg gcc cga ace tta ggc aaa cag caa 1951 

Pro Thr Pro Ser Gin Cys Val Val Ala Arg Thr Leu Gly Lys Gin Gin 
580 5^ 590 595 

act gtc atg gcc att get aca aag att gcc eta cag atg aac tgc aag 1999 
Thr Val Net Ala He Ala Thr Lys lie Ala Leu Gin Net Asn Cys Lys 
30 600 605 610 

atg gga gga gag etc tgg agg gtg gac ate ccc ctg aag etc gtg atg 2047 
Net Gly Gly Glu Leu Trp Arg Val Asp He Pro Leu Lys Leu Val Met 
615 620 625 

ate gtt ggc ate gat tgt tac cat gac atg aca get ggg egg agg tea 2095 
35 He Val Gly lie Asp Cys Tyr His Asp Net Thr Ala Gly Arg Arg Ser 

630 635 640 

ate gca gga ttt gtt gcc age ate aat gaa ggg atg acc cgc tgg ttc 2143 
tie Ala Gly Phe Val Ala Ser lie Asn Glu Gly Net Thr Arg Trp Phe 
645 650 655 
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tea cgc tgc ata ttt cag gat aga gga cag gag ctg gta gat ggg etc 2191 
Ser Arg Cys He phe Gin Asp Arg Gly Gin Glu Leu Val Asp Gly Leu 
660 665 670 675 

aaa gtc tgc ctg caa gcg get ctg agg get tgg aat age tgc aat gag 2239 
tys Val Cys Leu Gin Ala Ala Leu Arg Ala Trp Asn Ser Cys Asn Glu 
680 685 690 

tac atg ccc age egg ate ate gtg tae ege gat ggc gta gga gae ggc 2287 
Tyr Het Pro Ser Arg lie He Val Tyr Arg Asp Gly Val Gly Asp Gly 
695 700 705 

cag ctg aaa aca ctg gtg aac tac gaa gtg cca cag ttt ttg gat tgt 2335 
Gin Leu Lys Thr Leu Val Asn Tyr Glu Val Pro Gin Phe Leu Asp Cys 
710 715 720 

eta aaa tec att ggt aga ggt tae aac cct aga nta acg gta att gtg 2383 
Leu Lys Ser He Gly Arg Gly Tyr Asn Pro Arg Xaa Thr Val lie Val 
725 730 735 

gtg aag aaa aga gtg aae ace aga ttt ttt get eag tet gga gga aga 2431 
Val Lys Lys Arg Val Asn Thr Arg Phe Phe Ala Gin Ser Gly Gly Arg 
740 745 750 755 

ett cag aat cca ctt cct gga aca gtt att gat gta gag gtt ace aga 2479 
Leu Gin Asn Pro Leu Pro Gly Thr Val He Asp Val Glu Val Thr Arg 
760 765 770 

cca gaa tgg tat gae ttt ttt ate gtg age cag get gtg aga agt ggt 2527 
Pro Glu Trp Tyr Asp Phe Phe He Val Ser Gin Ala Val Arg Ser Gly 
775 780 785 

agt gtt tet ccc aca cat tae aat gtc ate tat gae aac age ggc ctg 2575 
Ser Val Ser Pro Thr His Tyr Asn Val He Tyr Asp Asn Ser Gly Leu 
790 795 800 

aag cca gae eac ata eag cgc ttg acc tac aag ctg tgc cae ate tat 2623 
Lys Pro Asp His He Gin Arg Leu Thr Tyr Lys Leu Cys Hfs He Tyr 
805 810 815 

tac aac tgg cca ggt gtc att cgt gtt cct get cct tgc cag tac gee 2671 
Tyr Asn Trp Pro Gly Val He Arg Val Pro Ala Pro Cys Gin Tyr Ala 
820 825 830 835 

cae aag ctg get ttt ctt gtt ggc cag agt att eac aga gag cca aat 2719 
His Lys Leu Ala Phe Leu Val Gly Gin Ser He His Arg Glu Pro Asn 
840 845 850 

ctg tea ctg tea aac cgc ctt tae tac etc taacctgcag aagacgatgc 2769 
Leu Ser Leu Ser Asn Arg Leu Tyr Tyr Leu 
855 860 
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agccgctttt ctttttgaaa tgactttggg 
taactgttat ctttctggat gaaacttggg 
ctagcattgc tattcaccgg cttccttatt 
atcttcttgt ttctcataga tattttgtga 
gataagatac ttggtagtat aaaacagact 
tttacttaaa cgtactttca ggagtgagca 
tttttgagat acctgttttg aatttaaagg 
acaaccatag gtggggtttc agctcatatc 
atacacaaga tacaggagaa aatatgcttg 
tgggagtaaa aaaaacattg aaaattttta 
taacaaaaaa ggccatgagt aaatctctat 
ctgggacatg attctatttg ttataaaata 

<210>6 
<211>861 
<212> PRT 
<213> Homo salens 
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atttttttaa gcttttattt actttttttt 2829 
aaggggatta ggagatctag cattttattt 2889 
ttatatgtaa aaattaagat tttatatttt 2949 
gcattttttt gtttattttg aagaaatgtg 3009 
ctctgagagt atttgaaatg tgtttggaga 3069 
agtcctactt ataaacctat attaacttta 3129 
agataagagg cgtaaagtag gatgctcact 3189 
ttaaagataa aaggtactat tatataacct 3249 
atttttattt ggcagggggg ctaggttgta 3309 
aattgtccaa agaaacattt taagactctt 3369 
attaacatca ctatttattt tgttttggaa 3429 
aaattgatgt ecc 3472 



<400> 6 

Nat Thr Gly Arg Ala Arg Ala Arg Ala Arg Gly Arg Ala Arg Gly Gin 
15 10 15 

Glu ThP Ala Gin Leu Val Gly Ser Thr Ala Ser Gin Gin Pro Gly Tyr 
20 25 30 

lie Gin Pro Arg Pro Gin Pro Pro Pro Ala Glu Gly Glu leu Phe Gly 
35 40 45 

Arg Gly Arg Gin Arg Gly Thr Ala Gly Gly Thr Ala Lys Ser Gin Gly 
50 55 60 



Leu Gin lie Ser Ala Gly Phe Gin Glu Leu Ser Xaa Ala Glu Arg Gly 
65 70 75 80 
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Giy Arg Arg Arg Asp Phe His Asp Leu Gly Val Asn Thr Arg Gin Asn 
8S 90 95 

Leu Asp His Val Lys Glu Ser Lys Thr Gly Ser Ser Gly lie He Val 
100 105 110 

Arg Leu Ser Thr Asn His Phe Arg Leu Thr Ser Arg Pro Gin Trp Ala 
115 120 125 

Leu Tyr Gin Tyr His He Asp Tyr Asn Pro Leu Het Glu Ala Arg Arg 
130 135 140 

Leu Arg Ser Ala Leu Leu Phe Gin His Glu Asp Leu lie Gly Lys Cys 
145 150 155 160 

His Ala Phe Asp Gly Thr lie Leu Phe Leu Pro Lys Arg Leu Gin Gin 
165 170 175 

Lys Val Thr Glu Val Phe Ser Lys Thr Arg Asn Gly Glu Asp Val Arg 
180 185 190 

I le Thr J le Thr Leu Thr Asn Glu Leu Pro Pro Thr Ser Pro Thr Cys 
195 200 205 

Leu Gin Phe Tyr Asn lie lie Phe Arg Arg Leu Leu Lys lie Met Asn 
210 215 220 

Leu Gin Gin He Gly Arg Asn Tyr Tyr Asn Pro Asn Asp Pro He Asp 
225 230 235 240 

He Pro Ser His Arg Leu Val He Trp Pro Gly Phe Thr Thr Ser He 
245 250 255 

Leu Gin Tyr Glu Asn Ser He Het Leu Cys Thr Asp Val Ser His Lys 
260 265 270 

Val Leu Arg Ser Glu Thr Val Leu Asp Phe Het Phe Asn Phe Tyr His 
275 280 2» 

Gin Thr Glu Glu His Lys Phe Gin Glu Gin Val Ser Lys Glu Xaa He 
290 295 300 

Gly Leu Val Val Leu Thr Lys Tyr Asn He Lys Thr Tyr Arg Val Asp 
305 310 315 320 



Asp He Asp Trp Asp Gin Asn Pro Lys Ser Thr Phe Lys Lys Ala Asp 
325 330 335 



Gly Ser Gly Val Ser Phe Leu Glu Tyr Tyr Arg Lys Gin Tyr Asn Gin 
340 345 350 
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Glu Ue Thr Asp leu Lys Gin Pro Val Leu Vat Ser Gin Pro lys Arg 
355 360 365 

Arg Arg Gly Pro Gly Gly Thr Leu Pro Gly Pro Ala Het Leu He Pro 
370 375 380 

Glu Leu Cys Tyr Leu Thr Gly Leu Thr Asp Lys Het Arg Asn Asp Phe 
385 390 395 AOO 

Asn Val Met Lys Asp Leu Ala Val His Thr Arg Leu Thr Pro Glu Gin 
405 410 415 

Arg Gin Arg Glu Val Gly Arg Leu He Asp Tyr lie His Lys Asn Asp 
420 425 430 

Asn Val Gin Arg Glu Leu Arg Asp Trp Gly Leu Ser Phe Asp Ser Asn 
435 440 445 

Leu Leu Ser Phe Ser Gly Arg He Leu Gin Thr Glu Lys He His Gin 
450 455 460 

Gly Gly Lys Thr Phe Asp Tyr Asn Pro Gin Phe Ala Asp Trp Ser Lys 
465 470 475 480 

Glu Thr Arg Gly Ala Pro Leu He Ser Val Lys Pro Leu Asp Asn Trp 
485 490 495 

Leu Leu He Tyr Thr Arg Arg Asn Tyr Glu Ala Ala Asn Ser Leu He 
500 505 510 

Gin Asn Leu Phe Lys Val Thr Pro Ala Met Gly Het Gin Het Arg Lys 
515 520 525 

Ala He Net He Glu Val Asp Asp Arg Thr Glu Ala Tyr Leu Arg Val 
530 535 540 

Leu Gin Gin Lys Val Thr Ala Asp Thr Gin He Val Val Cys Leu Leu 
545 550 555 560 

Ser Ser Asn Arg Lys Asp Lys Tyr Asp Ala He Lys Lys Tyr Leu Cys 
565 570 575 

Thr Asp Cys Pro Thr Pro Ser Gin Cys Val Val Ala Arg Thr Leu Gly 
580 585 590 



Lys Gin Gin Thr Val Net Ala He Ala Thr Lys He Ala Leu Gin Net 
595 600 605 



Asn Cys Lys Net Gly Gly Glu Leu Trp Arg Val Asp He Pro Leu Lys 
610 615 620 



wo 00/32039 



PCT/US99/28764 



-28- 



Leu Val Net He Val Gly lie Asp Cys Tyr His Asp Met Thr Ala Gly 
625 630 635 640 

Ars Arg Ser lie Ala Gly Phe Val Ala Ser He Asn Glu Gly Met Thr 
645 650 655 

Arg Trp Phe Ser Arg Cys He Phe Gin Asp Arg Gly Gin Glu Leu Val 
660 665 670 

Asp Gly Leu Lys Val Cys Leu Gin Ala Ala Leu Arg Ala Trp Asn Ser 
675 680 685 

Cys Asn Glu Tyr Met Pro Ser Arg He He Val Tyr Arg Asp Gly Val 
690 695 7D0 

Gly Asp Gly Gin Leu Lys Thr Leu Val Asn Tyr Glu Val Pro Gin Phe 
705 710 715 720 

Leu Asp Cys Leu Lys Ser He GLy Arg Gly Tyr Asn Pro Arg Xaa Thr 
725 730 735 

Val He Val Val Lys Lys Arg Val Asn Thr Arg Phe Phe Ala Gin Ser 
740 745 750 

Gly Gly Arg Leu Gin Asn Pro Leu Pro Gly Thr Val I le Asp Val Glu 
755 760 765 

Val Thr Arg Pro Glu Trp Tyr Asp Phe Phe He Val Ser Gin Ala Val 
770 775 780 

Arg Ser Gly Ser Val Ser Pro Thr His Tyr Asn Val He Tyr Asp Asn 
785 790 795 800 

Ser Gly Leu Lys Pro Asp His He Gin Arg Leu Thr Tyr Lys Leu Cys 
805 810 815 

His He Tyr Tyr Asn Trp Pro Gly Val He Arg Val Pro Ala Pro Cys 
820 825 830 

Gin Tyr Ala His Lys Leu Ala Phe Leu Val Gly Gin Ser He His Arg 
835 840 845 



Glu Pro Asn Leu Ser Leu Ser Asn Arg Leu Tyr Tyr Leu 
850 855 860 



<210>7 
<211>42 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthesized 
Oligonucleotide Primer 

5 <400> 7 

gatcatatgg agcsaaaget tattagcgag gaaoatctga at 42 

<210> 8 
<211>42 

<212> Dl^ 
1 0 <21 3> Artifidal Sequence 
<220> 

<223> Description of Artificial Sequence: Syntliesized 
Oligonucleotide Primer 

<400>8 

15 gatcattcag atcttcctcg ctaataagct tttgctccat at 42 

<210> 9 
<211>45 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: Syntliesized 
Oligonucleotide Primer 



<400> 9 



wo 00/32039 
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-30- 

ctagcatatg agcaaaagct tattagcgag gaagatctga ataag 45 

<210> 10 
<211>45 
<212> DNA 
5 <213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Syntliesized 
Oligonucleotide Primer 

<400> 10 

10 ctagcttatt cagatcttcc tcgctaataa gcttttgctc atatg 45 

<210>11 
<211> 17 
<212> DNA 

<21 3> Artificial Sequence 
15 <220> 

<223> Description of Artificial Sequence: Synthesized 
Oligonucleotide Primer 

<400>11 



acgataagtt ctgttat 
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